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Abstract 

Cn , In this paper we present and evaluate a general framework for the design of truthful 

auctions for matching agents in a dynamic, two-sided market. A single commodity, such 
^-H ■ as a resource or a task, is bought and sold by multiple buyers and sellers that arrive 

' and depart over time. Our algorithm. Chain, provides the first framework that allows a 

• . truthful dynamic double auction (DA) to be constructed from a truthful, single-period (i.e. 

^ ' static) double- auction rule. The pricing and matching method of the Chain construction 

is unique amongst dynamic-auction rules that adopt the same building block. We examine 
experimentally the allocative efficiency of Chain when instantiated on various single-period 
^ . rules, including the canonical McAfee double-auction rule. For a baseline we also consider 

\Q ' non-truthful double auctions populated with "zero-intelligence plus" -style learning agents. 

. CuAiN-based auctions perform well in comparison with other schemes, especially as arrival 

' intensity falls and agent valuations become more volatile. 

1— I ■ 

1. Introduction 

Electronic markets are increasingly popular as a method to facilitate increased efficiency 
in the supply chain, with firms using markets to procure goods and services. Two-sided 
markets facilitate trade between many buyers and many sellers and find application to 
;h ' trading diverse resources, including bandwidth, securities and pollution rights. Recent 

^ years have also brought increased attention to resource allocation in the context of on- 

demand computing and grid computing. Even within settings of cooperative coordination, 
such as those of multiple robots, researchers have turned to auctions as methods for task 
allocation and joint exploration (Gerkey & Mataric, 2002; Lagoudakis et ah, 2005; Lin &: 
Zheng, 2005). 

In this paper we consider a dynamic two-sided market for a single commodity, for in- 
stance a unit of a resource (e.g. time on a computer, some quantity of memory chips) or 
a task to perform (e.g. a standard database query to execute, a location to visit). Each 
agent, whether buyer or seller, arrives dynamically and needs to be matched within a time 
interval. Cast as a task-allocation problem, a seller can perform the task when allocated 
within some time interval and incurs a cost when assigned. A buyer has positive value for 
the task being assigned (to any seller) within some time interval. The arrival time, accept- 
able time interval, and value (negative for a seller) for a trade are all private information 
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to an agent. Agents are self-interested and can choose to misrepresent all and any of this 
information to the market in order to obtain a more desirable price. 

The matching problem combines elements of online algorithms and sequential decision 
making with considerations from mechanism design. Unlike traditional sequential decision 
making, a protocol for this problem must provide incentives for agents to report truthful 
information to a match-maker. Unlike traditional mechanism design, this is a dynamic 
problem with agents that arrive and leave over time. We model this problem as a dynamic 
double auction (DA) for identical items. The match-maker becomes the auctioneer. Each 
seller brings a task to be performed during a time window and each buyer brings the 
capability to perform a single task. The double-auction setting also is of interest in its own 
right as a protocol for matching in a dynamic business-to-business exchange. 

Uncertainty about the future coupled with the two-sided nature of the market leads to 
an interesting mechanism design problem. For example, consider the scenario where the 
auctioneer must decide how (and whether) to match a seller with reported cost of $6 at the 
end of its time interval with a present and unmatched buyer, one of which has a reported 
value of $8 and one a reported value of $9. Should the auctioneer pair the higher bidder 
with the seller? What happens if a seller, willing to sell for $4, arrives after the auctioneer 
acts upon the matching decision? How should the matching algorithm be designed so that 
no agent can benefit from misstating its earliest arrival, latest departure, or value for a 
trade? 

Chain provides a general framework that allows a truthful dynamic double auction to 
be constructed from a truthful, single-period (i.e. static) double-auction rule. The auctions 
constructed by Chain arc truthful, in the sense that the dominant strategy for an agent, 
whatever the future auction dynamics and bids from other agents, is to report its true 
value for a trade (negative if selling) and true patience (maximal tolerance for trade delay) 
immediately upon arrival into the market. We also allow for randomized mechanisms and, 
in this case, require strong truthfulness: the DA should be truthful for all possible random 
coin flips of the mechanism. One of the DAs in the class of auctions implied by Chain 
is a dynamic generalization of McAfee's (1992) canonical truthful, no-deficit auction for a 
single period. Thus, we provide the first examples of truthful, dynamic DAs that allow for 
dynamic price competition between buyers and sellers.^ 

The main technical challenge presented by dynamic DAs is to provide truthfulness with- 
out incurring a budget deficit, while handling uncertainty about future trade opportunities. 
Of particular concern is to ensure that an agent does not indirectly affect its price through 
the effect of its bid on the prices faced by other agents and thus other supply and demand 
in the market. We need to preclude this because the availability of trades depends on the 
price faced by other agents. For example, a buyer that is required to pay $4 in the DA to 
trade might like to decrease the price that a potentially matching seller will receive from $6 
to $3 to allow for trade. 

Chain is a modular approach to auction design, which takes as a building block a single- 
period matching rule and provides a method to invoke the rule in each of multiple periods 
while also providing for truthfulness. We characterize properties that a well-defined single- 

1. The closest work in the literature is due to Blum et al. (2006), who present a truthful, dynamic DA for 

our model that matches bids and asks based on a price sampled from some bid-independent distribution. 
We compare the performance of our schemes with this scheme in Section 6. 
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period matching rule must satisfy in order for Chain to be truthful. We further identify the 
technical property of strong no-trade, with which we can isolate agents that fail to trade in 
the current period but can nevertheless survive and be eligible to trade in a future period. 
An auction designer defines the strong no-trade predicate, in addition to providing a well- 
defined single-period matching rule. Instances within this class include those constructed 
in terms of both "price-based" matching rules and "competition-based" matching rules. 
Both can depend on history and be adaptive, but only the competition-based rules use the 
active bids and asks to determine the prices in the current period, facilitating a more direct 
competitive processes. 

In proving that Chain, when combined with a well-defined matching rule and a valid 

strong no-trade predicate, is truthful we leverage a recent price-based characterization for 
truthful online mechanisms (Hajiaghayi et al., 2005). We also show that the pricing and 
matching rules defined by Chain are unique amongst the family of mechanisms that are 
constructed with a single-period matching rule as a building block. Throughout our work we 
assume that a constant limits every buyer and seller's patience. To motivate this assumption 
we provide a simple environment in which no truthful, no-deficit DA can implement some 
constant fraction of the number of the efficient trades, for any constant. 

We adopt allocative efficiency as our design objective, which is to say auction protocols 
that maximize the expected total value from the sequence of trades. We also consider net 
efficiency, wherein any net outflow of payments to the marketmaker is also accounted for in 
considering the quality of a design. Experimental results explore the allocative efficiency of 
Chain when instantiated to various single-period matching rules and for a range of different 
assumptions about market volatility and maximal patience. For a baseline we consider the 
efficiency of a standard (non-truthful) open outcry DA populated with simple adaptive 
trading agents modeled after "zero-intelligence plus" (ZIP) agents (Cliff & Bruten, 1998; 
Preist &L van Tol, 1998). We also compare the efficiency of Chain with that of a truthful 
online DA due to Blum et al. (2006), which selects a fixed trading price to guarantee 
competitiveness in an adversarial model. 

Prom within the truthful mechanisms we find that adaptive, price-based instantiations 

of Chain arc the most effective for high arrival intensity and low volatility. Even defining 
a single, wcll-choscn price that is optimized for the market conditions can be reasonably 
effective in promoting efficient trades in low volatility environments. On the other hand, 
for medium to low arrival intensity and medium to high volatility we find that the Chain- 
based DAs that allow for dynamic price competition, such as the McAfee-based rule, are 
most efficient. The same qualitative observations hold whether one is interested in alloca- 
tive efficiency or net efficiency, although the adaptive, price-based methods have better 
performance in terms of net efficiency. The Blum et al. (2006) rule fairs poorly in our 
tests, which is perhaps unsurprising given that it is optimized for worst-case performance 
in an adversarial setting. When populated with ZIP agents, we find that non-truthful DAs 
can provide very good efficiency in low volatility environments but poor performance in 
high volatility environments. The good performance of the ZIP-based market occurs when 
agents learn to bid approximately truthfully; i.e., when the market operates as if truthful, 
but without incurring the stringent cost (e.g., through trading constraints) of imposing 
truthfulness explicitly. An equilibrium analysis is available only for the truthful DAs; we 
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have no way of knowing how close the ZIP agents are to playing an equilibrium, and note 
that the ZIP agents do not even consider performing time-based manipulations. 

1.1 Outline 

Section 2 introduces the dynamic DA model, including our assumptions, and presents 
desiderata for online DAs and a price-based characterization for the design of truthful 
dynamic auctions. Section 3 defines the Chain algorithm together with the building block 
of a well-defined, single-period matching rule and the strong no-trade predicate. Section 4 
gives a number of instantiations to both price-based and competition-based matching rules, 
including a general method to define the strong no-trade predicate given a price-based in- 
stantiation. Section 5 proves truthfulness, no-deficit and feasibility of the Chain auctions 
and also establishes their uniqueness amongst auctions constructed from the same single- 
period matching-rule building block. The importance of the assumption about maximal 
agent patience is established. Section 6 presents our empirical analysis, including a descrip- 
tion of the simple adaptive agents that we use to populate a non-truthful open-outcry DA 
and provide a benchmark. Section 7 gives related work. In Section 8 we conclude with a 
discussion about the merits of truthfulness in markets and present possible extensions. 

2. Preliminciries: Basic Definitions 

Consider a dynamic auction model with discrete, possibly infinite, time periods T = 
{1, 2, . . .}, indexed by t. The double auction (DA) provides a market for a single commodity. 
Agents are either buyers or sellers interested in trading a single unit of the commodity. An 
agent's type, 9i = {ai,di,iVi) E Qi, where Qi is the set of possible types for agent i, defines 
an arrival a^, departure di, and value itij G M for trade. If the agent is a buyer, then Wi > 0. 
If the agent is a seller, then 'u;^ < 0. We assume a maximal patience K, so that di < ai + K 
for all agents. 

The arrival time models the first time at which an agent learns about the market or 
learns about its value for a trade. Thus, information about its type is not available before 
period (not even to agent i) and the agent cannot engage in trade before period a,. The 
departure time, di, models the final period in which a buyer has positive value for a trade, 
or the final period in which a seller is willing to engage in trade. We model risk-neutral 
agents with quasi- linear utility, Wi — p when a trade occurs in i G [ai,di] and payment p 
is collected (with ^? < if the agent is a seller). Agents are rational and self-interested, 
and act to maximize expected utility. By assumption, sellers have no utility for payments 
received after their true departure period. 

Throughout this paper we adopt bid to refer, generically, to a claim that an agent - 
either a buyer or a seller - makes to a DA about its type. In addition, when we need to be 
specific about the distinction between claims made by buyers and claims made by sellers 
we refer to the bid from a buyer and the ask from a seller. 

2.1 Example 

Consider the following naive generalization of the (static) trade-reduction DA (Lavi & Nisan, 
2005; McAfee, 1992) to this dynamic environment. A bid from an agent is a claim about 
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its type 6i = {ai,di,Wi), necessarily made in period t = hi. Bids are active while t G [aj,(ij] 
and no trade has occurred. 

Then in each period use the trade-reduction DA to determine which (if any) of the 
active bids trade and at what price. These trades occur immediately. The trade-reduction 
DA (tr-DA) works as follows: Let B denote the set of bids and S denote the set of asks. 
Insert a dummy bid with value +00 into B and a dummy ask with value into S. When 
|S| > 2 and jS"! > 2 then sort B and S in order of decreasing value. Let W},^ > Wf,^ > ■ ■ ■ and 
> > . . . denote the bid and ask values with {bQ,SQ) denoting the dummy bid-ask 
pair. Let m > index the last pair of bids and asks to clear in the efficient trade, such that 
Wbm + ^ ^i^d ii'b^+i + w's^+i < 0. When m > 2 then bids {61, ... , bm-i} and asks 
{si, . . . , Sm-i} trade and payment Wf,^ is collected from each winning buyer and payment 
—Wg^ is made to each winning seller. 

First consider a static tr-DA with the following bids and asks: 



i 


B 

UH 


i 


s 

Wi 


K 


15 




-1 


b*2 


10 


^2 


-1 


b*3 


4 


t* 


-2 


64 


3 


S4 


-2 




S5 


-5 



The line indicates that bids (1 4) and asks (1-4) could be matched for efficient trade. 
By the rules of the tr-DA, bids (1-3) and asks (1-3) trade, with payments $3 collected from 
winning buyers and payment $2 made to winning sellers. The auctioneer earns a profit of 
$3. The asterisk notation indicates the bids and asks that trade. The tr-DA is truthful, in 
the sense that it is a dominant-strategy for every agent to report its true value whatever 
the reports of other agents. For intuition, consider the buy-side. The payment made by 
winners is independent of their bid price while the losing bidder could only win by bidding 
more than $4, at which point his payment would be $4 and more than his true value. 

Now consider a dynamic variation with buyer types {(1, 2, 15), (1, 2, 10), (1, 2, 4), (2, 2, 3)} 
and seller types {(1, 2, -1), (2, 2, -1), (1, 1, -2), (2, 2, -2), (1, 2, -5)}. When agents are truth- 
ful, the dynamic tr-DA plays out as follows: 



period 1 




period 2 


B 


S 




B 
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i Wi 


i Wi 


i 


Wi 


i Wi 


bl 15 


4 -1 


b*2 


10 


S*2 -1 


62 10 


S3 -2 


b3 


4 


S4 -2 


63 4 


S5 -5 


bi 


3 


S5 -5 



In period 1, buyer 1 and seller 1 trade at payments of $10 and $2 respectively. In 
period 2, buyer 2 and seller 2 trade at payments of $4 and $2 respectively. But now we can 
construct two kinds of manipulation to show that this dynamic DA is not truthful. First, 
buyer 1 can do better by delaying his reported arrival until period 2: 
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period 1 


period 2 
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i Wi 


i Wi 


i Wi 


i Wi 


hi 10 
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bl 15 


S*2 -1 


63 4 


S3 -2 


63 4 


S4 -2 




S5 -5 


64 3 


S5 -5 



Now, buyer 2 trades in period 1 and does not set the price to buyer 1 in period 2. Instead, 
buyer 1 now trades in period 2 and makes payment $4. 

Second, buyer 3 can do better by increasing his reported value: 
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period 2 





B 




s 




B 




S 


i 


■Wi 


i 


Wi 


i 


Wi 


i 


Wi 


h\ 


15 


Si 


-1 




6 


^2 


-1 




10 


*3 


-2 


64 


3 


S4 


-2 




6 


S5 


-5 






S5 


-5 



Now, buyers 1 and 2 botli trade in period 1 and this allows buyer 3 to win (at a price 
below his true value) in period 2. This is a particularly interesting manipulation because 
the agent's manipulation is by increasing its bid above its true value. By doing so, it allows 
more trades to occur and makes the auction less competitive in the next period. 

2.2 Dynamic Double Auctions: Desiderata 

We consider only direct-revelation, dynamic DAs that restrict the message that an agent 
can send to the auctioneer to a single, direct claim about its type. We also consider "closed" 
auctions so that an agent receives no feedback before reporting its type and cannot condition 
its strategy on the report of another agent. ^ 

Given this, let 0* denote the set of agent types reported in period t,9 = {9^ ,6'^ , ... ,9*, ... ,] 
denote a complete type profile (perhaps unbounded), and 0-* denote the type profile re- 
stricted to agents with (reported) arrival no later than period t. A report 9i = {ai,di,Wi) 
represents a commitment to buy (sell) one unit of the commodity in any period t € [ai,c?j] 
for a payment of at most Wi. Thus, if a seller reports a departure time di > di, it must 
commit to complete a trade that occurs after her true departure and even though a seller 
is modeled as having no utility for payments received after her true departure. 

A dynamic DA, M = {Tr,x), defines an allocation policy tt = {vr*}*^^ and payment 
policy X = {.x*}*^'^, where tt\{6-*) G {0, 1} indicates whether or not agent i trades in period 
t given reports 6*-*, and x\{9-^) G M indicates a payment made by agent i, negative if this is 
a payment received by the agent. The auction rules can also be stochastic, so that 7r*(^-*) 
and x\{9-^) are random variables. For a dynamic DA to be well defined, it must hold that 



TT- 



1 in at most one period t € [oj, di] and zero otherwise, and the payment collected 
from agent i is zero except in periods t G [oj , di] . 

In formalizing the desiderata for dynamic DAs, it will be convenient to adopt {'k{9),x{9)) 
to denote the complete sequence of allocation decisions given reports 9, with shorthand 



2. The restriction to direct-revelation, online mechanisms is without loss of generality when combined with 

a simple heart-beat message from an agent to indicate its presence in any period t during its reported 
arrival-departure interval. See the work of Pai and Vohra (2006) and Parkes (2007). 
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Tri{0) G {0, 1} and Xi{d) € M to indicate whether agent i trades during its reported arrival- 
departure interval, and the total payment made by agent i, respectively. By a slight abuse 
of notation, we write i G 9-^ to denote that agent i reported a type no later than period t. 
Let B denote the set of buyers and S denote the set of sellers. 

We shall require that the dynamic DA satisfies no-deficit, feasibility, individual-rationality 
and truthfulness. No-deficit ensures that the auctioneer has a cash surplus in every period: 

Definition 1 (no-deficit) A dynamic DA, M = (tt, x) is no-deficit if: 

J2 H ^f(^-*')>o, \/t,\/e (1) 

ie6>^*t'G[ai,min{Mi)] 

Feasibility ensures that the auctioneer does not need to take a short position in the 
commodity traded in the market in any period: 

Definition 2 (feasible trade) A dynamic DA, M = (tt, x) is feasible if: 

E E 4(0^')- E E 4{0^')>0, W,V0 (2) 

ieOi*,ieS t'e[ai ,min(t,di)] ieO^* ,ieB t' e[ai ,min{t,di)] 

This definition of feasible trade assumes that the auctioneer can "hold" an item that 
is matched between a seller-buyer pair, for instance only releasing it to the buyer upon 
his reported departure. See the remark concluding this section for a discussion of this 
assumption. 

Let Vi{6i,TT{6'-,6^i)) G M denote the value of an agent with type 9i for the allocation 
decision made by policy tt given report (9'-,9^i), i.e. Vi{9i,TT(9'-,9^i)) = wi if the agent 
trades in period t G [a^, dj] and if it trades outside of this interval and is a buyer, or — oo if 
it trades outside of this interval and is a seller. Individual-rationality requires that agent Vs 
utility is non-negative when it reports its true type, whatever the reports of other agents: 

Definition 3 (individual-rational) A dynamic DA, M = {it,x) is individual-rational 
(IR) ifvi{9i,Tr{9)) - Xi{9) > for all i, all 9. 

In order to define truthfulness, we introduce notation C{9i) C 0j for 9i G 0^ to denote 
the set of available misreports to an agent with true type 9i. In the standard model adopted 
in offline mechanism design, it is typical to assume C{9i) = 0j with all misreports available. 
Here, we shall assume no early-arrival misreports, with C{9i) = {9i = (oj, di,Wi) : ai < ai < 
di\. This assumption of limited misreports is adopted in earlier work on online mechanism 
design (Hajiaghayi et al., 2004), and is well-motivated when the arrival time is the first 
period in which a buyer first decides to acquire an item or the period in which a seller first 
decides to sell an item. 

Definition 4 (truthful) Dynamic DA, Al = (vr, x), is dominant- strategy incentive- 
compatible, or truthful, given limited misreports C if: 

Vi{9i,TT{9i,9'_i))-Xi{9i,9'_i) > Vi{9i,Tr{9i,9'_i))-Xi{9i,9'_i). 

for all 9i G C{9i), all 9i, all 9'_^ G C{9-i), all 9-i G Q-i. 
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This is a robust equilibrium concept: an agent maximizes its utility by reporting its 
true type whatever the reports of other agents. Truthfulness is useful because it simplifies 
the decision problem facing bidders: an agent can determine its optimal bidding strategy 
without a model of either the auction dynamics or the other agents. In the case that 
the allocation and payment policy is stochastic, then we adopt the requirement of strong 
truthfulness so that an agent maximizes its utility whatever the random sequence of coin 
flips within the auction. 

Remark. The flexible definition of feasibility, in which the auctioneer is able to take a 
long position in the commodity, allows the auctioneer to time trades by receiving the unit 
sold by a seller in one period but only releasing it to a buyer in a later period. This allows 
for truthfulness in environments in which bidders can overstate their departure period. In 
some settings this is an unreasonable requirement, however, for instance when the com- 
modity represents a task that is performed, or because a physical good is being traded 
in an electronic market.^ In these cases, the definition of feasibility strengthened to re- 
quire exact trade-balance in every period. The tradeoff is that available misreports must 
be further restricted, with agents limited to reporting no late- departures in addition to no 
early- arrivals (Lavi & Nisan, 2005; Hajiaghayi et al., 2005). For the rest of the paper we 
work in the "relaxed feasibility, no early-arrival" model. The Chain framework can be im- 
mediately extended to the "strong-feasibility, no early-arrival and no late-departure" model 
by executing trades immediately rather than delaying the trade until a buyer's departure. 

3. Chain: A Framework for Truthful Dynamic DAs 

Chain provides a general algorithmic framework with which to construct truthful dynamic 
DAs from well-defined single-period matching rules, such as the tr-DA rules described in 

the earlier section. 

Before introducing Chain we need a few more definitions: Bids reported to Chain are 
active while t < di (for reported departure period dj), and while the bid is unmatched 
and still eligible to be matched. In each period, a single-period matching rule is used to 
determine whether any of the active bids will trade and also which (if any) of the bids that 
do not match will remain active in the next period. 

Now we define the building blocks, well-defined single-period matching rules, and intro- 
duce the important concept of a strong no-trade predicate, which is defined for a single- 
period matching rule. 

3.1 Building Block: A Single- Period Matching Rule 

In defining a matching rule, it is helpful to adopt 6* G M^q and s* G M<o to denote the 
active bids and active asks in period t, where there are m > and n > bids and asks 
respectively. The bids and asks that were active in earlier periods but are no longer active 
form the history in period t, denoted iJ* G M.^ where h > is the size of the history. 

A single-period matching rule (hereafter a matching rule), Mmr = (T^mnXmr) defines 
an allocation rule 7r^r.(i?*, 6*, s*, w) G {0,1}'^"^'^"') and a payment rule x^r(-f^*) ■s*,a;) G 

3. Note that if the task is a computational task, then tasks can be handled within this model by requiring 
that the seller performs the task when it is matched but with a commitment to hold onto the result until 
the matched buyer is ready to depart. 



140 



Chain: An Online Double Auction 



function SiMPLEMATCH(if*,6*,s*) 
matched := 
39* := mean(|ff*|) 
while (&* ^ 0)&(s* 7^ 0) do 

i := 0, bi := — e, j := 0, := —00 
while {bi < p*)k{b* ^ 0) do 

i random(6*), 6* := 6* \ {i} 
end while 

while {sj < -p*)k{s^ ^ 0) do 

i := random(s*), s* := s* \ {j} 
end while 

if [i 0)&(j 7^ 0) then 

matched := matched U 
end if 
end while 
end function 

Figure 1: A well-defined matching rule defined in terms of the mean bid price in the history. 

]^(m+n) jj(,j.g^ include random event a; G J7 to allow explicitly for stochastic matching 
and allocation rules. 

Definition 5 (well-defined matching rule) A niMtching rule M^r = {'^mr,Xmr) is well- 
defined when it is strongly truthful, no-deficit, individual-rational, and strong -feasible. 

Here, the properties of truthfulness, no-deficit, and individual-rationality are exactly 
the single-period specializations of those defined in the previous section. For instance, 
a matching rule is truthful in this sense when the dominant strategy for an agent in a 
DA defined with this rule, and in a static environment, is to bid truthfully and for all 
possible random events uj. Similarly for individual-rationality. No-deficit requires that the 
total payments are always non-negative. Strong-feasibility requires that exactly the same 
number of asks are accepted as bids, again for all random events. 

One example of a well-defined matching rule is the tr-DA, which is invariant to the 
history of bids and asks. For an example of a well-defined, adaptive (history-dependent) 
and price-based matching rule, consider procedure SimpleMatch in Figure 1. The SlM- 
pleMatch matching rule computes the mean of the absolute value of the bids and asks in 
the history if* and adopts this as the clearing price in the current period. It is a stochastic 
matching rule because bids and asks are picked from the sets 6* and s* at random and 
offered the price. We can reason about the properties of SimpleMatch as follows: 

(a) truthful: the price is independent of the bids and the probability that a bid (or 
ask) is matched is independent of its bid (or ask) price 

(b) no-deficit: payment is collected from each matched buyer and made to each 

matched seller 

(c) individual-rational: only bids 6j > and asks sj > — p* are accepted. 

(d) feasible: bids and asks are introduced to the "matched" set in balanced pairs 
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3.2 Reasoning about Trade (Im) Possibility 

In addition to defining a matching rule M^j., we allow a designer to (optionally) designate 
a subset of losing bids that satisfy a property of strong no-trade. Bids that satisfy strong 
no-trade are losing bids for which trade was not possible at any bid price (c.f. ask price 
for asks), and moreover for which additional independence conditions hold between bids 
provided with this designation. 

We first define the weaker concept of no-trade. In the following, notation 
Trmr,i{H*,b^,s^,(^\wi) indicates the allocation decision made for bid (or ask) i when its bid 
(ask) price is replaced with wf. 

Definition 6 (no-trade) Given matching rule M^r = {j^mriXmr) then the set of agents, 
NT*, for which no trade is possible in period t and given random events u are those for 
which TTmr,i{H* -1^^ 1 ,oj\wi) = 0, for every wi G R>o when i E and for every wi G M<o 
when i E s*. 

It can easily happen that no trade is possible, for instance when the agent is a buyer 
and there are no sellers on the other side of the market. Let SNT* C NT* denote the set 
of agents designated with the property of strong no-trade. Unlike the no-trade property, 
strong no-trade need not be uniquely defined for a matching rule. To be valid, however, the 
construction offered by a designer for strong no-trade must satisfy the following: 

Definition 7 (strong no-trade) A construction for strong no-trade, SNT* C NT*, is 
valid for a matching rule when: 

(a) yi E NT* with di > t, whether or not i G SNT* is unchanged for all alternate reports 
e[ = {a[,d[,w[)^ei while d[>t, 

(b) Vf G SNT* with di > t, the set {j : j G SNT*,j 7^ i,dj > t} is unchanged for all 
reports 9'- = {a'-, d[,w[) 7^ Oi while > t, and independent even of whether or not agent i is 
present in the market. 

The strong no-trade conditions must be checked only for agents with a reported depar- 
ture later than the current period. Condition (a) requires that such an agent in NT* cannot 
affect whether or not it satisfies the strong no-trade predicate as long as it contimies to 
report a departure later than the current period. Condition (b) is defined recursively, and 
requires that if such an agent is identified as satisfying strong no-trade, then its own report 
must not affect the designation of strong no-trade to other agents, with reported departure 
later than the current period, while it continues to report a departure later than the current 
period - even if it delays its reported arrival until a later period. 

Strong no-trade allows for flexibility in determining whether or not a bid is eligible for 
matching. Specifically, only those bids that satisfy strong no-trade amongst those that lose 
in the current period can remain as a candidate for trade in a future period. The property 
is defined to ensure that such a "surviving" agent does not, and could not, affect the set of 
other agents against which it competes in future periods. 
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Example 1 Consider the tr-DA matching rule defined earlier with bids and asks 
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Bid 1 and ask 1 trade at price 8 and —6 respectively. NT* = because bids 2 and 3 could 
each trade if they had (unilaterally) submitted a bid price of greater than 10. Similarly for 
asks 2 and 3. Now consider the order book 
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No trade occurs. In this case, NT* = {61,62,63,51}. No trade is possible for any bids, even 

bids 2 and 3, because Wb-^ +'Ws2 = 8 — 10 < 0. But, trade is possible for asks 2 and 3, because 
Wb^ + Wsi = 7 — 6 > and either ask could trade by submitting a low enough ask price. 

Example 2 Consider the tr-DA matching rule and explore possible alternative construc- 
tions for strong no-trade. 

(i) Dictatorial: in each period t, identify an agent that could be present in the period in 
a way that is oblivious to all agent reports. Let i denote the index of this agent. If i & NT*, 
then include SNT* = {z}. Strong no-trade condition (a) is satisfied because whether or not i 
is selected as the "dictator" is agent-independent, and given that it is selected, then whether 
or not trade is possible is agent-independent. Condition (b) is trivially satisfied because 
I SNT* I = 1 and there is no cross-agent coupling to consider. 

(a) SNT* := NT*. Consider the order book 
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Suppose all bids and asks remain in the market for at least one more period. Clearly, 
NT* = {61,62,63,81,52,^3}. Consider the candidate construction SNT* = NT*. Strong no- 
trade condition (a) is satisfied because whether or not i is in set NT* is agent-independent. 
Condition (b) is not satisfied, however. Consider bid 2. If bid 2's report had been 8 instead 
of 2 then trade would be possible for bids 1 and 3, and SNT* = NT* = {62, si, S2, S3}. Thus, 
whether or not bids 1 and 3 satisfy the strong no-trade predicate depends on the value of bid 
2. This is not a valid construction for strong no-trade for the tr-DA matching rule. 

(Hi) SNT* = NT* if |6*| < 2 or |s*| < 2, and SNT* = otherwise. As above, strong 
no-trade condition (a) is immediately satisfied. Moreover, condition (b) is now satisfied 
because trade is not possible for any bid or ask irrespective of bid values because there are 
simply not enough bids or asks to allow for trade with tr-DA (which needs at least 2 bids 
and at least 2 asks). 
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Figure 2: The decision process in Chain upon arrival of a new bid. If admitted, then the bid 

participates in a sequence of matching events while it remains unmatched and in the 
strong no-trade set. The bid matches at the first available opportunity and is priced 
immediately. 



Example 3 Consider a variant of the SimpleMatch matching rule, defined with fixed 
price 9. We can again ask whether SNT* := NT* is a valid construction for strong no-trade. 
Throughout this example suppose all bids and asks remain in the market for at least one 
more period. First consider a bid with = 8 and two asks with values Wgi = —6 and 

= —7. Here, NT* = {.si,S2} because the asks cannot trade whatever their price since 
the bid is not high enough to meet the fixed trading price of 9. Moreover, SNT* = {si,S2} 
is a valid construction; strong no-trade condition (a) is satisfied as above and condition (b) 
is satisfied because whether or not ask 2 is in NT* ( and thus SNT* j is independent of the 
price on ask 1, and vice versa. But consider instead a bid with = 8 and an ask with 

= —10. Now, NT* = {6i,si} and SNT* = {5i,si} is our candidate strong no-trade set. 
However if bid 1 had declared value 10 instead of 8 then NT* = {61} and ask 1 drops out of 
SNT*. Thus, strong no-trade condition (b) is not satisfied. 

We see from the above examples that it can be quite delicate to provide a valid, non- 
trivial construction of strong no-trade. Note, however, that SNT* = is a (trivial) valid 
construction for any matching rule. Note also that the strong no-trade conditions (a) and 
(b) require information about the reported departure period of a bid. Thus, while the 
matching rules do not use temporal information about bids, this information is used in the 
construction for strong no-trade. 

3.3 Chain: From Matching Rules to Truthful, Dynamic DAs 

The control flow in CHAIN is illustrated in Figure 2. Upon arrival of a new bid, an admission 
decision is made and bid i is admitted if its value Wi is at least its admission price qi. An 
admitted bid competes in a sequence of matching events, where a matching event simply 
applies the matching rule to the set of active bids and asks. If a bid fails to match in some 
period and is not in the strong no-trade set (i ^ SNT*), then it is priced out and leaves the 
market without trading. Otherwise, if it is still before its departure time {t <di), then it is 
available for matching in the next period. 

Each bid is always in one of three states: active, matched or priced-out. Bids are active 
if they are admitted to the market until i < dj, or until they are matched or priced-out. An 



144 



Chain: An Online Double Auction 



active bid becomes matched in the first period (if any) when it trades in the single-period 
matching rule. An active bid is marked as priced-out in the first period in which it loses 
but is not in the strong no-trade set. As soon as a bid is no longer active, it enters the 
history, , and the information about its bid price can be used in defining matching rules 
for future periods. 

Let denote the set of bids that will expire in the current period. A well-defined 
matching rule, when coupled with a valid strong no-trade construction, must provide Chain 
with the following information, given history H*, active bids 6* and active asks s*, and 
expiration set in period t: 

(a) for each bid or ask, whether it wins or loses 

(b) for each winning bid or ask, the payment collected (negative for an ask) 

(c) for each losing bid or ask, whether or not it satisfies the strong no-trade condition 

Note that the expiration set E*' is only used for the strong no-trade construction. This 
information is not made available to the matching rule. The following table summarizes the 
use of this information within Chain. Note that a winning bid cannot be in set SNT*: 
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SNT* 


Lose 


priced-out 


survive 


Win 


matched 


n/a 



We describe Chain by defining the events that occur for a bid upon its arrival into the 
market, and then in each period in which it remains active: 

Upon arrival: Consider all possible earlier arrival periods t' G [di — K,ai — 1] consistent 
with the reported type. There are no such periods to consider if the bid is maximally 
patient. If the bid would lose and not be in SNT* for any one of these arrival periods 
t', then it is not admitted. Otherwise, the bid would win in all periods t' for which 
i ^ SNT* , and define the admission price as: 

q{ai,di,e-i,uj) := max [pf,-oo], (3) 

t'e[di-K,ai-l],i^SNT* 

where p*' is the payment the agent would have made (negative for a seller) in arrival 
period t' (as determined by running the myopic matching rule in that period). When 
the agent would lose in all earlier arrival periods t' (and so i G SNT* for all t'), or 
the bid is maximally patient, then the admission price defaults to — oo and the bid is 
admitted. 

While active: Consider period i G [oj, di]. If the bid is selected to trade by the myopic 
matching rule, then mark it as matched and define final payment: 

= niax{q{di,di,e-i,u}),p\), (4) 

where p^ is the price (negative for a seller) determined by the myopic matching rule in 
the current period. If this is a buyer, then collect the payment but delay transferring 
the item until period di. If this is a seller, then collect the item but delay making 
the payment until the reported departure period. If the bid loses and is not in SNT*, 
then mark the bid as priced- out. 
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We illustrate Chain by instantiating it to various matching riilcs in the next section. 
In Section 5 we prove that Chain is strongly truthful and no-deficit when coupled with a 
well-defined matching rule and a valid strong no-trade construction. We will see that the 
delay in buyer delivery and seller payment ensures truthful revelation of a trader's departure 
information. For instance, in the absence of this delay, a buyer might be able to do better by 
over-reporting departure information, still trading early enough but now for a lower price. 

3.4 Comments 

We choose not to allow the single-period matching rules to use the reported arrival and 
departure associated with active bids and asks. This maintains a clean separation between 
non-temporal considerations (in the matching rules) and temporal considerations (in the 
wider framework of Chain). This is also for simplicity. The single-period matching rules 
can be allowed to depend on the reported arrival-departure interval, as long as the (single- 
period) rules are monotonic in tighter arrival-departure intervals, in the sense that an agent 
that wins for some O-i = (a,, di,Wi) continues to win and for an improved price if it instead 
reports {a[,d'^,Wi) with [a-,(i-] C [di,di]. However, whether or not trade is possible must 
be independent of the reported arrival-departure interval and similarly for strong no-trade. 
Determinations such as these would need to be made with respect to the most patient type 
{di - K, di,Wi) given report 6i = (a^, di, Wi). 

4. Practical Instantiations: Price-Based and Competition-Based Rules 

In this section we offer a number of instantiations of the Chain online DA framework. We 
present two different classes of well-defined matching rules: those that are price-based and 
compute simple price statistics based on the history which are then used for matching, and 
those that we refer to as competition-based and leverage the history but also consider direct 
competition between the active bids and asks in any period. In each case, we establish that 
the matching rules are well-defined and provide a valid strong no-trade construction. 

4.1 Price-Based Matching Rules 

Each one of these rules constructs a single price, p*, in period t based on the history i7* of 
earlier bids and asks that are traded or expired. For this purpose we define variations on a 
real valued statistic, ^{H^), that is used to define this price given the history. Generalizing 
the SimpleMatch procedure, as introduced in Section 3.1, the price is used to determine 
the trades in period t. We also provide a construction for strong no-trade in this context. 

The main concern in setting prices is that they may be too volatile, with price updates 
driving the admission price higher (via the max operator in the admission rule of Chain) 
and having the effect of pricing bids and asks out of the market. We describe various forms 
of smoothing and windowing, all designed to provide adaptivity while dampening short- 
term variations. In each case, the parameters (e.g. the smoothing factor, or the window 
size) can be determined empirically through off-line tuning. 

We experiment with five price variants: 

History-EWMA: Exponentially-weighted moving average. The bid history, H^, is used 
to define price in period t, computed as := A ^{H^) + (1 — A)p*~^, where A G (0, 1] is a 
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smoothing constant and ^(-ff*) is a statistic defined for bids and asks that enter the history 
in period t. Experimentally we find that the mean statistic, of the absolute 

values of bids and asks that enter the history performs well with A of 0.05 or lower for most 
scenarios that we test. For cases in which £,{H^) is not well-defined because of too few (or 
zero) new bids or asks, then we set p* := p*^^. 

History- median: Compute price p* from a statistic over a fixed-size window of the most 
recent history, p* := £,{H* ■, A) where A is the window-size, i.e. defining bids introduced 
to history in periods [t — A, . . . , t]. Experimentally, we find that the median statistic, 
^median ^ ^-j^ of the absolutc bid and ask values performs well for the scenarios we test, 
with the window size depending inversely with the volatility of agents' valuations. Typically, 
we observe optimal window sizes of 20 and 150, depending on volatility. For cases in which 
^(i?*,A) is not well-defined because of too few (or zero) new bids or asks, then we set 

History-clearing: Identical to the history-median rule except the statistic ^(i?*,A) is 
defined as [hm — Sm)/^ where bm and Sm are the lowest value pair of trades that would be 
executed in the efficient (value-maximizing) trade given all bids and asks to enter history 
in periods [t — A, . . . ,t]. Empirically, we find similar optimal window sizes for history- 
clearing as for history-median. 

History-McAfee: Define the statistic £,(H^, A) to represent the McAfee price, defined in 
Section 4.2, for the bids in had they all simultaneously arrived. 

Fixed price: This simple rule computes a single fixed price p* := p* for all trading periods, 
with the price optimized offline to max;imize the average-case efficiency of the dynamic DA 
given Chain and the associated single-period matching rule that leverages price p* as the 
candidate trading price. 

For each pricing variant, procedure MATCH (sec Figures 3-4) is used to determine which 
bids win (at price p*), which lose, and, of those that lose, which satisfy the strong no- 
trade predicate. The subroutine used to determine the current price is referred to as 
determineprice in Match. We provide as input to Match the set in addition to 
{H^,b^,s*) because Match also constructs the strong no-trade set, and is used exclu- 
sively for this purpose. 

The proof of the following lemma is technical and is postponed until the Appendix. 

Lemma 1 Procedure Match defines a valid strong no-trade construction. 

Theorem 1 Procedure Match defines a well-defined matching rule and a valid strong no- 
trade construction. 

Proof: No-deficit, feasibility, and individual-rationality are immediate by the construction 
of Match since bids and asks are added to matched in pairs, with the same payment, and 
only if the payment is less than or equal to their value. Truthfulness is also easy to see: the 
order with which a bid (or ask) is selected is independent of its bid price, and the price it 
faces, when selected, is independent of its bid. If the price is less than or equal to its bid, 
then whether or not it trades depends only on its order. The rest of the claim follows from 
Lemma 1. □ 
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function MATcn{H\b\s\E^) 

matched := 0, lose := 0, NT* := 0, SNT* := 
stop := false 

:= determineprice(-ff*) 
while -1 stop do 

i := 0, j := 0, checked^ := 0, checkeds := 
while ((checked^ C b*)k{i=0)) V ((checked^ C s*)&(j=0)) do 
if {i = 0)&(j = 0) then 

k := random(6*\ checked^ |J s*\ checked^) 
else if {i = 0) then 

k := random ( 6* \ checked^) 
else if (j = 0) then 

k := random(s*\ checked^) 
end if 

if {k G 6*) then 

checked^ := checked^ U {A;} 

if {bk ^ P^) then 
i := k 

end if 
else 

checked^ := checked^ U {fc} 
if {sk > —P^) then 

j := k 
end if 
end if 
end while 

if (i ^ 0)&(j ^ 0) then 
matched := matched |J 

lose := lose |J (checked^ \ {i}) \J (checked5 \ {j}) 
6* := 6*\ checkedB, s* := s*\ checkedg 
else 

stop := true 
end if 
end while 
end function 

Figure 3: The procedure used for single-period matching in applying Chain to the price- 
based rules. The algorithm continues in Figure 4. 
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function Match (continued) (i?*, 6*, s*,£?*) 

if {i / 0)k{j = 0) then > I 

lose := lose (J s*, NT* := 6* 

if {3k e 6* ■ {{bk > p^)k{dk = t))) V (VA; G s* • (4 = t)) then 
SNT* := 6* 

else > I-a 

SNT* := 5*\ checked^ 
end if 

else if [j ^ 0)&(i = 0) then > II 

lose := lose |J^*, NT* := s* 

if {3k G s* • {{sk > -p*)Hdk = t))) V (V/c € 6* • (4 = t)) then 

SNT* := s* 
else 

SNT* := s*\ checkeds 
end if 

else if {i = 0)&(j = 0) then > III 

NT* := 6* U s* 

if (Vfe G 6* • (4 = t)) V (V/c G s* • (4 = t)) then 

SNT* := 6* U s* 
end if 
end if 
end function 

Figure 4: Continuing procedure from Figure 3 for single-period matching in applying Chain 
to the price-based rules. 



Example 4 (i) Bid 6* = {8}, ask s* = {—6}, indexed {1,2} and price p* = 9. The outer 
while loop in Figure 3 terminates with j = 2 and i = in Case II. The bid is marked as 
a loser while NT* = {2}. // the bid will depart immediately, then SNT* = {2}, otherwise 
SNT* = 0. 

(a) Bid 6* = {8}, asks s* = {—6, —7}, indexed {1, 2, 3}, and price p* = 9. Suppose that 
ask 2 is selected before ask 3 in the outer while loop. Then the loop terminates with j = 2 
and i = in Case II and NT* = {2,3}. Suppose the bid and asks leave the market later 
than this period. Then SNT* = {3} because checkeds = {2}. 

(Hi) Bid 5* = {8} and ask s* = { — 10}, indexed {1, 2}, price p* = 9 and both the bid and 
the ask is patient. The outer while loop terminates with i = and j = in Case III so 
that NT* = {1,2}. However, SNT* = 0. 



4.2 Competition-Based Matching Rules 

Each one of these rules determines which bids match in the current period through price 
competition between the active bids. We present three variations: McAfee, Windowed- 
McAfee and Active-McAfee. The latter two rules are hybrid rules in that they leverage 
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history of past offers, in smoothing prices generated by the competition-based matching 
rules. 

McAfee: Use the static DA protocol due to McAfee as the matching rule. Let B denote 
the set of bids and S denote the set of asks. If min([i?|, \S\) < 2, then there is no trade. 
Otherwise, first insert two dummy bids with value {oo, 0} and two dummy asks with value 
{0, — oo} into the set of bids and asks. Let bo > bi > . . . > bm and sq > si > . . . > 
Sn ■ ■ ■ denote the bid and ask values with (6o, sq) denoting dummy pair (oo, 0) and {bm, s„) 
denoting dummy pair (0, — oo) and ties otherwise broken at random. Let m > index 
the last pair of bids and asks to clear in the efficient trade, such that bm + Sm ^ and 
6^+1 + Sm+i < 0. When m> 1, consider the following two cases: 

• (Case I) If price Pm+i = < 5^ and —pm+i < Sm then the first m bids and 
asks trade and payment Pm+i is collected from each winning buyer and made to each 
winning seller. 

• (Case II) Otherwise, the first m — 1 bids and asks trade and payment bm is collected 
from each winning buyer and payment — Srn is made to each winning seller. 

To define NT*, replace a bid that does not trade with a bid reporting a very large value 
and see whether this bid trades. To determine whether trade is possible for an ask that 
does not trade: replace the ask with an ask reporting value e > 0, some small e. Say 
that there is a quorum if and only if there are at least two bids and at least two asks, i.e. 
min(|6*|, |s*|) > 2. Define strong no-trade as follows: set SNT* := NT* = 6* U s* when 
there is no quorum and SNT* := otherwise. 

Lemma 2 For any bid bi in the McAfee matching rule, then for any other bid (or ask) j 
there is some bid bi that will make trade possible for bid (or ask) j when there is a quorum. 

Proof: Without loss of generality, suppose there are three bids and three asks. Label the 
bids (a, c, e) and the asks {b,d,f), both ordered from highest to lowest so that (a, 6) is the 
most competitive bid-ask pair. Proceed by case analysis on bids. The analysis is symmetric 
for asks and omitted. Let tp{i) G {0, 1} denote whether or not trade is possible for bid i, so 
that i G NT* <^ tp{i) = 0. For bid a: when b > -(a - d)/2 then tp{c) = tp{e) = 1 and this 
inequality can always be satisfied for a large enough a; when a > {c — d)/2 then tp{b) = 1 
and when a> (c — b)/2 then tp(d) = tp{f) = 1, and both of these inequalities are satisfied 
for a large enough a. For bid c: when b > —{c — d)/2 then tp{a) = 1 and when, in addition, 
c > a, then tp{e) = 1 and each one of these inequalities are satisfied for a large enough c; 
similarly when c > {a — d)/2 then tp{b) = 1 and when c > {a — b)/2 then tp{d) = tp{f) = 1. 
Analysis for bid e follows from that for bid c. □ 

Lemma 3 The construction for strong no-trade is valid and there is no valid strong no- 
trade construction that includes more than one losing bid or ask that will not depart in the 
current period for any period in which there is a quorum. 

Proof: To see that this is a valid construction, notice that strong no-trade condition (a) 
holds since any bid (or ask) is always in both NT* and SNT*. Similarly, condition (b) 
trivially holds (with the other bids and asks remaining in SNT* even if any bid is not 
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present in the market). To see that this definition is essentially maximal, consider now that 
min(|6*|, |s*|) > 2. For contradiction, suppose that two losing bids with departure 

after the current period are designated as strong no-trade. But, strong no-trade condition 
(b) fails because of Lemma 2 because either bid could have submitted an alternate bid price 
that would remove the other bid from NT* and thus necessarily also from SNT*. □ 
The construction offered for SNT* cannot be extended even to include one agent selected 
at random from the set i € NT* that will not depart immediately, in the case of a quorum. 
Such a construction would fail strong no-trade condition (b) when the set NT* contains 
more than one bid (or ask) that does not depart in the current period, because bid z's 
absence from the market would cause some other agent to be (randomly) selected as SNT*. 

Windowed-McAfee: This myopic matching rule is parameterized on window size A. 
Augment the active bids and asks with the bids and asks introduced to the history i7* in 
periods t' € {t — A + 1, . . . ,t}. Run McAfee with this augmented set of bids and asks and 
determine which of these bids and asks would trade. Denote this candidate set C. Some 
active agents identified as matching in C may not be able to trade in this period because 
C can also contain non-active agents. 

Let B' and S" denote, respectively, the active bids and active asks in set C. Windowed- 
McAfee then proceeds by picking a random subset of min(|i?'|, |S"|) bids and asks to trade. 
When \B'\ / then some bids and asks will not trade. 

Define strong no-trade for this matching rule as: 

(i) if there are no active asks but active bids, then SNT* := 6* 

(ii) if there are no active bids but active asks, then SNT* := s* 

(iii) if there are fewer than 2 asks or fewer than 2 bids in the augmented bid set, then 
SNT* := 6* U s*, 

and otherwise set SNT* := 0. In all cases it should be clear that SNT* C NT*. 

Lemma 4 The strong no-trade construction for windowed- McAfee is valid. 

Proof: That this is a valid SNT criteria in case (iii) follows immediately from the validity 
of the SNT criteria for the standard McAfee matching rule. Consider case (i). Case (ii) 
is symmetric and omitted. For strong no-trade condition (a), we see that all bids i G NT* 
and also i € SNT*, and whether or not they are designated strong no-trade is independent 
of their own bid price but simply because there are no active asks. Similarly, for strong 
no-trade condition (b), we see that all bids (and never any asks) are in SNT* whatever the 
bid price of any particular bid (and even whether or not it is present). □ 

Empirically, we find that the efficiency of Windowed-McAfee is sensitive to the size of 
i?*, but that frequently the best choice is a small window size that includes only the active 
bids. 

Active-McAfee: Active-McAfee augments the active bids and asks to include all un- 
expired but traded or priced-out offers. It proceeds as in Windowed-McAfee given this 
augmented bid set. 
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4.3 Extended Examples 

We next provide two stylized examples to demonstrate the matching performed by Chain 
using both a price-based and a competition-based matching rule. For both examples, we 
assume a maximal patience of K = 2. Moreover, while we describe when Chain determines 
that a bid or an ask trades, remember that a winning buyer is not allocated the good until 
its reported departure and a winning seller does not receive payment until its reported 
departure. 

Example 5 Consider Chain using an adaptive, price-based matching rule. The particular 

details of how prices are determined are not relevant. Assume that the prices in periods 1 
and 2 are = {8, 7} and the maximal patience is three periods. Now consider period 

3 and suppose that the order book is empty at the end of period 2 and that the bids and asks 
in Table 1 arrive in period 3. 
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Table 1: Bids and asks that arrive in period 3. Bids {bi, 62} match with asks {s2, 84} (as indicated 
with a *). Bid 63 is priced-out upon admission because Qb-j > ^63 (indicated with a strike- 
through). The admission price is qi and the payment made by an agent that trades is 
Pi. Column 'SNT?' indicates whether or not the bid or ask satisfies the strong no-trade 
predicate. Asks {51,55} survive into the next period because they are in SNT and have 
di > 3. 

Bids {61,52,64} and asks {si,..,S5} are admitted. Bid 63 is priced out because qi,^ = 
max(p^,p^, — 00) = max(8,7, — 00) = 8 > w^^ = 7 by Eq. (3). Note that 64 and S5 are 
admitted despite low bids (asks) because they have maximal patience and their admission 
prices are —00. Now, suppose that p""' := 6.5 is defined by the matching rule and consider 
applying Match to the admitted bids and asks. 

Suppose that the bids are randomly ordered as (64,62,61) and the asks as 
(s4, 52, Sl, S3, S5). Bid 64 is picked first but priced-out because = 6 < p^ = 6.5. Bid 
62 is tentatively accepted (w}^^ = 10 > = 6.5j and then ask S4 is accepted (ws^ = —5 > 
p^ = — 6.5J. Bid 62 is matched with ask S4, with payment max((/ft2,P^) = max(8, 6.5) = 8 
for 62 by Eq. (4) and payment max(gs^,p^) = max(— 00, —6.5) = —6.5 for S4. Bid 61 is then 
tentatively accepted (15 > 6.5) and then matched with ask S2, which is accepted because 
—3 > —6.5. The payments are max(7, 6.5) = 7 for 61 and max(— 00, —6.5) = —6.5 for S2. 
Ask S3 expires but asks si and S5 survive and are marked i G SNT in this period because 
they were never offered the chance to match with any bid. These asks will be active in period 

4- 

Note the role of the admission price in truthfulness. Had bid 61 delayed arrival until 
period 4, its admission price would be max(p^,p^, —00) = max(7, 6.5) = 7 and its payment 
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in period 4 (if it matches) at least 7. Similarly, had ask S4 delayed arrival, then its admission 
price would be max(— 7, —6.5, —00) = —6.5 and the maximal payment it can receive in period 
4 is 6.5. 

Example 6 Consider Chain using the McAfee-based matching rule with K = 3 and with 
the same bids and asks arriving in period 3. Suppose that the prices in periods 1 and 2 that 
would have been faced by a buyer are {p^,^^} = {8, 7} and {p\-,p1} = {—7, —6} for a seller. 
These prices are determined by inserting an additional bid ( with value 00) or an additional 
ask (with value 0) into the order books in each of periods 1 and 2. We will illustrate this 
for period 3. Consider now the bids and asks in period 3 in Table 2. 



i 


Wi 


di 


B 

d^-K 


Qi 


Pi 


SNT? 


i 


Wi 


di 


S 

di-K 


Qi 


Pi 


SNT? 


bi* 


15 


4 


2 


7 


7 


N 




-1 


4 


2 


-6 


-4 


N 


62* 


10 


3 


1 


8 


8 


N 


S2* 


-3 


5 


3 


— CXD 


-4 


N 




7 


3 


1 


8 


n/a 
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S3 


-4 


3 


1 


-6 


n/a 


N 




6 


5 


3 


—00 


n/a 


N 


S4 


-5 


4 


2 


-6 


n/a 


N 
















S5 


-10 


5 


3 


—00 


n/a 


N 



Table 2: Bids and asks that arrive in period 3. Bids {bi, 62} match with asks {si, S2} (as indicated 
with a *). Bid 63 is priced-out upon admission because qbs > Wb-^- The admission price is 
Qi and the payment made by an agent that trades is pi. Column 'SNT?' indicates whether 
or not the bid or ask satisfies the strong no-trade predicate. No asks or bids survive into 
the next period. 

As before bid 63 is not admitted. The myopic matching rule now runs the (static) McAfee 

auction rule on bids {61,62564} and asks {si, ..,3^}. Consider bids and asks in decreasing 
order of value, the last efficient trade is indexed m = 3 with wi,^ + = 6 — 4 > 0. But 
Pm+i = (0 — (— 5))/2 = 2.5 (inserting a dummy bid with value as described in Section 4-^)- 
Price —pm+i = —2.5 > S3 = —4 and this trade cannot be executed by McAfee. Instead, 
buyers {61,62} trade and face price = Wb4^ = 6 and sellers {51,^2} trade and face price 
pf^ = Wss = —4. Bids 64 and asks {53,54,55} are priced-out and do not survive into the 
next round. Ultimately, payment mdx{qb^,p^) = max(7, 6) = 7 is collected from buyer 61 
and payment max.{qi,2,Pm) — inax(8, 6) = 8 is collected from buyer 62. For sellers, payment 
max(— 6, —4) = —4 and max(— 00, —4) = —4 for si and S2 respectively. 

The prices pf and p^ that are used in Eq. (3) to define the admission price for bids and 
asks with arrivals in periods 4 o,nd 5 are determined as follows. For the buy-side price, we 
introduce an additional bid with bid-price 00. With this the bid values considered by McAfee 
would he (00, 15, 10, 6, 0) and the ask values would he (—1, —3, —4, —5, —10), where a dummy 
hid with value is included on the huy-side. The last efficient pair to trade is m = A with 
6 — 5 > and Pm+i = (0 ~ (~10))/2 = 5, which satisfies this bid-ask pair. Therefore the 
buy-side price, pf := 5. On the sell-side, we introduce an additional ask with ask-price 
so that the hid values considered by McAfee are (15,10,6,0) (again, with a dummy hid 
included) and the ask values are (0, — 1, — 3, — 4, — 5, — 10). This time m = 3 and the last 
efficient pair to trade is 6 — 3 > 0. Now Pm+i = (0 — (— 4))/2 = 2 and this price does not 
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satisfy S2, with —prn+\ > •§2 o,nd price pf^+i = S2 = —3 is adopted. Therefore the sell-side 
price, p^ := —3. 

Again, we can see that bidder 1 cannot improve its price by delaying its entry until period 
4- The admission price for the bidder would be max(p^,p^) = max(7,p^) > 7 and thus its 
payment in period 4, if it matches, will be at least 7.^ Similarly for ask si, which would face 
admission price max{j)g,j)g} = max{— 6, —4} = —4 and can receive a payment of at most 4 
in period 4- We leave it as an exercise for the reader to verify that p^ = —4 if ask si delays 
its arrival until period 4 (in comparison, p^ = —3 when ask si is truthful). 

Because the McAfee-based pricing scheme computes a price and clears the order book 
following every period in which there are at least two bids and two asks, the bid activity 
periods tend to be short in comparison to the adaptive, price-based rules where orders can 
be kept active longer when there is an asymmetry in the number of bids and asks in the 
market. In fact, one interesting artifact that occurs with adaptive, price-based matching 
rules is that the admission-price and SNT can perpetuate this kind of bid-ask asymmetry. 
Once the market has more asks than bids, SNT becomes likely for future asks, but not bids. 
Therefore, bids are much more likely than asks to be immediately priced out of the market 
by failing to meet the admission price constraint. 

5. Theoretical Analysis: Truthfulness, Uniqueness, and Justifying 
Bounded- Patience 

In this section we prove that Chain combined with a well-defined matching rule and a valid 
strong no-trade construction generates a truthful, no-deficit, feasible and individual-rational 
dynamic DA. In Section 5.2, we establish that uniqueness of Chain amongst dynamic DAs 
that are constructed from single-period matching rules as building blocks. In Section 5.3, 
we establish the importance of the existence of a maximal bound on bidder patience by 
presenting a simple environment in which no truthful, no-deficit DA can implement even a 
single trade despite the number of efficient trades can be increased without bound. 

5.1 The Chain Mechanism is Strongly Truthful 

It will be helpful to adopt a price-based interpretation of a valid single-period matching rule. 
Given rule M^,., define an agent-independent price, Zi{H^, A^\i,uj) G M where yl* = 6* U .5*, 
such that for all i, all bids 6*, all asks s*, all history if*, and all random events w G fi. We 
have: 

(Al) Wi - Zi{H\A* \ i,oj) > ^ ■Kjnr,i{H^,b\s^,oj) = 1, and Wi - Zi{H^,A* \ i,oj) < 
Trmr,i{H\b\s\uj) = 

(A2) payment Xrar,iiH^,b^,s^,<^) = Zi{H^,A^ \ ^,<^) if T^mr,iiH^ , = 1 and 

Xmr,i{H^-,b*,s*,oj) = Otherwise 

4. We can check that pi := 6 in this case. Suppose that bidder 1 were not present in period 3. Now consider 
introducing an additional bid with value oo so that the bids values arc {oo, 10, 6, 0} (with a dummy bid) 
with ask values {—1, —3, —4, —5, —10}. Then m = 3 and Pm+i = (0 — (— 5))/2 = 2.5, which does not 
support the trade between bid 64 and ask S3. Instead, = = 6 is adopted, and we would have 
pI := 6. Of course, this is exactly the price determined by McAfee for bid bi in period 3 when the bidder 
is truthful. 
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The interpretation is that there is an agent-independent price, Zi{H*,A* \ i,u}), that is 
at least Wi when the agent loses and no greater than lii otherwise. In particular, Zi{H^, A*' \ 
i,uj) = 00 when i G NT*. Although an agent's price is only explicit in a matching rule when 
the agent trades, it is well known that such a price exists for any truthful, single-parameter 
mechanism; e.g., see works by Archer and Tardos (2001) and Goldberg and Hartline (2003).^ 
Moving forward we adopt price Zi to characterize the matching rule used as a building block 
for Chain, and assume without loss of generality properties (Al) and (A2). 

Given this, we will now establish the truthfulness of Chain by appeal to a price-based 
characterization due to Hajiaghayi et al. (2005) for truthful, dynamic mechanisms. We state 
(without proof) a variant on the characterization result that holds for stochastic policies 
(tt, x) and sfron^^truthfulness. The theorem that we state is also specialized to our DA 
environment. We continue to adopt w G to capture the realization of stochastic events 
internal to the mechanism: 

Theorem 2 (Hajiaghayi et al., 2005) A dynamic DA M = (7r,a;), perhaps stochastic, is 
strongly truthful for misreports limited to no early-arrivals if and only if, for every agent i, 
all 9i, all 0-i, and all random events a; G J7, there exists a price pi{ai,di,9-i,co) such that: 

(Bl) the price is independent of agent i's reported value 

(B2) the price is monotonic-increasing in tighter [a^,d'^ C [oi,di] 

(B3) trade nilOijO-i) = 1 whenever pi{ai,di,9-i,Ld) < Wi and T:i{9i,0-i) = whenever 
Pi{ai, di, 9-i, co) > Wi, and the trade is performed for a buyer upon its departure period 
di. 

(B4) the agent's payment is Xi{9i,9-i) = pi{ai,di,9-i,uj) when Tri{9i,9-i) = 1, with 
Xi{9i,9-i) = otherwise, and the payment is made to a seller upon its departure, di. 

where random event uj is independent of the report of agent i in as much as it affects the 
price to agent i. 

Just as for the single-period, price-based characterization, the price ]3j(aj, dj, 6'_j, oj) need 
not always be explicit in Chain. Rather, the theorem states that given any truthful dynamic 
DA, such as Chain, there exists a well-defined price function with these properties of value- 
independence (Bl) and arrival-departure monotonicity (B2), and such that they define the 
trade (B3) and the payment (B4). 

To establish the triithfulness of CHAIN, we prove that it is well-defined with respect to 
the following price function: 

Pi{ai,di,9-i,u) = max{q{ai,di,9-i,uj),pi{ai,di,9-i,uj)), (5) 

where 

q{ai,di,9-i,u;) = max {zi{H* , A^ \i,u;), -00) (6) 

tGldi-K,a,-l],i0NT* 

5. A single-parameter mechanism is one in which the private information of an agent is hmited to one 
number. This fits the single-period matching problem because the arrival and departure information 

is discarded. Moreover, although there are both buyers and sellers, the problem is effectively single- 
parameter because no buyer can usefully pretend to be a seller and vtce versa. 



155 



Bredin, Parkes and Duong 



and 



p{ai,di,9-i,u) 



{ 



Zi{H**,A^* \i,uj) , if decision{i) = 1 
+00 , otherwise 



(7) 



where decision{i) = indicates that i € SNT* for all t G [ai,di] and decision{i) = 1 
otherwise, and where t* G [di,di] is the first period in which i ^ SNT*. We refer to this 
as the decision period. Term q{ai,di,6^i,uj) denotes the admission price, and is defined on 
periods t before the agent arrives for which i ^ SNT* had it arrived in that period. Note 
carefully that the rules of Chain are implicit in defining this price function. For instance, 
whether or not i G SNT* in some period t depends, for example, on the other bids that 
remain active in that period. 

We now establish conditions (B1)-(B4). The proofs of the technical lemmas are deferred 
until the Appendix. The following lemma is helpful and gets to the heart of the strong no- 
trade concept. 

Lemma 5 The set of active agents ( other than i) in period t in Chain is independent of 
i 's report while agent i remains active, and would be unchanged if i 's arrival is later than 
period t. 

The following result establishes properties (Bl) and (B2). 

Lemma 6 The price constructed from admission price q and post-arrival price p is value- 
independent and m,onotonic-increasing when the matching rule in Chain is well-defined, 
the strong no-trade construction is valid, and agent patience is bounded by K. 

Having established properties (Bl) and (B2) for price function pj(aj, dj, w), we just 
need to establish (B3) and (B4) to show truthfulness. The timing aspect of (B3) and (B4), 
which requires that the buyer receives an item and the seller receives its payment upon 
reported departure, is already clear from the definition of Chain. 

Theorem 3 The online DA Chain is strongly truthful, no-deficit, feasible and individual- 
rational when the matching rule is well-defined, the strong no-trade construction is valid, 
and agent patience is bounded by K. 

Proof: Properties (Bl) and (B2) follow from Lemma 6. The timing aspects of (B3) and 
(B4) are immediate. To complete the proof, we first consider (B3). li q > Wi, then agent 
i is priced out at admission by Chain because this reflects that Zi{H^,A^ \i,^) > Wi in 
some t & [di — K, dj — 1] with i ^ SNT* , and thus the bid would lose if it arrived in that 
period (either because it could trade, but for a payment greater than its reported value, or 
because i G NT*). Also, if there is no decision period, then p = 00, which is consistent with 
Chain, because there is no bid price at which a bid will trade when i G SNT* for all periods 
i G [di, di\. Suppose now that there is a decision period t* and q < Wi. li p > Wi, then there 
should be no trade. This is the case in Chain, because the price Zi{H^*,A^* \ i, uj) in t* is 
greater than Wi and thus the agent is priced-out. p < Wi then the bid should trade and 
indeed it does, again because the price Zi in that period satisfies (Al) and (A2) with respect 
to the matching rule. Turning to (B4), it is immediate that the payments collected in Chain 
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are equal to price pi{ai,di,6-i,uj), because if bid i trades then pi{ai,di,9-i,u}) < Wi and 
thus q < Wi and p <Wi. The admission price q{di,di,9^i,Ljj) = q(di,di,6^i,uj) when q < Wi 
because price Zi is well-defined by properties (Al) and (A2). Similarly, the payment p** 
defined by the matching rule in Chain in the decision period is equal to p. 

That Chain is individual-rational and feasible follows from inspection. Chain is no- 
deficit because the payment collected from every agent (whether a buyer or a seller) is at 
least that defined by a valid matching rule in the decision period t* (it can be higher when 
the admission price is higher than this matching price), the matching rules are themselves 
no-deficit, and because the auctioneer delays making a payment to a seller until its reported 
departure but collects payment from a buyer immediately upon a match. □ 

We remark that information can be reported to bidders that arc not currently partici- 
pating in the market, for instance to assist in their valuation process. If this information 
is delayed by at least the maximal patience of a bidder, so that the bid of a current bidder 
cannot influence the other bids and asks that it faces, then this is without any strategic 
consequences. Of course, without this constraint, or with bidders that participate in the 
market multiple times, the effect of such feedback would require careful analysis and bring 
us outside of the private values framework. 

5.2 Chain is Unique amongst Dynamic DAs that are constructed from Myopic 
Matching Rules 

In what follows, we establish that Chain is unique amongst all truthful, dynamic DAs that 
adopt well-defined, myopic matching rules as simple building blocks. For this, we define 
the class of canonical, dynamic DAs, which take a well-defined single period matching rule 
coupled with a valid strong no-trade construction, and satisfy the following requirements: 

(i) agents are active until they are matched or priced-out, 

(ii) agents participate in the single-period matching rule while active 

(iii) agents are matched if and only if they trade in the single-period matching rule. 

We think that these restrictions capture the essence of what it means to construct a 
dynamic DA from single-period matching rules. Notice that a number of design elements are 
left undefined, including the payment collected from matched bids, when to mark an active 
bid as priced-out, what rule to use upon admission, and how to use the strong no-trade 
information within the dynamic DA. In establishing a uniqueness result, we leverage the 
necessary and sufficient price-based characterization in Theorem 2, and exactly determine 
the price function pi{ai,di,6-i,oo) to that defined in Eq. (4) and associated with Chain. 
The proofs for the two technical lemmas are deferred until the Appendix. 

Lemma 7 A strongly truthful, canonical dynamic DA must define price pi{di,di,9^i,uj) > 
Zi{H^*,A^* \ i,Lo) where t* is the decision period for bid i (if it exists). Moreover, the bid 
must be priced- out in period t* if it is not matched. 

Lemma 8 A strongly truthful, canonical and individual-rational dynamic DA must define 
price pi{ai,di,0-i,u) > q{ai,di,0-i,oj), and a bid with wi < q{di,di,0-i,u) must be priced- 
out upon admission. 
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Theorem 4 The dynamic DA algorithm Chain uniquely defines a strongly truthful, 
individual-rational auction among canonical dynamic DAs that only designate bids as priced- 
out when necessary. 

Proof: If there is no decision period, then we must have Pi{ai, di, 9-i, oj) = oo, by canonical 
(iii) coupled with (B3). Combining this with Lemmas 7 and 8, we have Pi(ai,di,6-i,io) > 
m.SLx{q{di,di,9-i,u}),p{ai,di,6-i,Lo)). We have also established that a bid must be priced- 
out if its bid value is less than the admission price, or it fails to match in its decision 
period. Left to show is that the price is exactly as in Chain, and that a bid is admitted 
when its value Wi > q{ai,di,6-i,uj) and retained as active when it is in the strong no- 
trade set. The last two control aspects are determined once we choose a rule that "only 
designates bids as priced-out when necessary." We prefer to allow a bid to remain active 
when this does not compromise truthfulness or individual-rationality. Finally, suppose 
for contradiction that p' = pi{ai,di,6^i,uj) > max(q{ai,di,6^i,uj),p(di,di,9-i,Ljj)). Then 
an agent with iaiax{q(ai,di,9-i,uj),p{di,di,9-i,Ld)) < wi < p' would prefer to bid Wi = 
q{ai,di,0-i,co),p{ai,di,0-i,co)) — e and avoid winning - otherwise its payment would be 
greater than its value. □ 

5.3 Bounded Patience Is Required for Reasonable Efficiency 

Chain depends on a maximal bound on patience used to calculate the admission price faced 
by a bidder on entering the market with Eq. (3). To motivate this assumption about the 
existence of a maximal patience, we construct a simple environment in which the number of 
trades implemented by a truthful, no-deficit DA can be made an arbitrarily small fraction 
of the number of efficient trades with even a small number of bidders having potentially un- 
bounded patience. This illustrates that a bound on bidder patience is required for dynamic 
DAs with reasonable performance. 

In achieving this negative result, we impose the additional requirement of anonymity, 
This anonymity property is already satisfied by Chain, when coupled with matching rules 
that satisfy anonymity, as is the case with all the rules presented in Section 4. In defining 
anonymity, extend the earlier definition of a dynamic DA, M = {Tr,x), so that allocation 
policy vr = {vr*}*^"^ defines the probability 7r*(^-*) € [0, 1] that agent i trades in period t 
given reports 9-^. Payment, x = {x*}*^^, continues to define the payment xl{9-*) by agent 
i in period t, and is a random variable when the mechanism is stochastic. 

Definition 8 (anonymity) A dynamic DA, M = (tt, x) is anonymous if allocation policy 
TT = {tt*}*^-^ defines probability of trade 7r*(^-*) in each period t that is independent of 
identity i and invariant to a permutation of {9-^\i) and if the payment ■x\{9-*), contingent 
on trade by agent i, is independent of identity i and invariant to a permutation of {9-^ \ i). 

We now consider the following simple environment. Informally, there will be a random 
number of high- valued phases in which bids and asks have high value and there might be a 
single bidder with patience that exceeds that of the other bids and asks in the phase. These 
high-valued phases are then followed by some number, perhaps zero, of low-valued phases 
with bounded-patience bids and asks. Formally, there arc > 1 high-valued phases 
(a random variable, unknown to the auction), each of duration L > 1 periods, indexed 
/c € {0, 1, . . . , T/i — 1} and each with: 
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• AT or iV - 1 bids with type (1 + kL, {k + 1)L, vh), 

• or 1 bids with type (1 + kL,d,avH) for some mark-up parameter, a > 1 and some 
high-patience parameter, d £T, 

• N asks with type (1 + kL, {k + 1)L, —{vh — e)), 

followed by some number (perhaps zero) of low-valued phases, also of duration L, and 
indexed k € {T/j, . . . , oo}, with: 

• iV or - 1 bids with type (1 + kL, {k + 1)L, vl) 

• N asks with type (1 + kL, {k + 1)L, —{vl — e)), 

where N > 1, < vl < vh, and bid-spread parameter e > 0. Note that any phase can 
be the last phase, with no additional bids or asks arriving in the future. 

Definition 9 (reasonable DA) A dynamic DA is reasonable in this simple environment 
if there is some parameterization of new bids, N > 1, and periods-per-phase, L > 1, for 
which it will execute at least one trade between new bids and new asks in each phase, 
for any choice of high value vh, low value vl < vh, bid-spread e > 0, mark-up a> 1, high 
patience d. 

All of the dynamic DAs presented in Section 4 can be parameterized to make them 
reasonable for a suitably large > 1 and L >1, and without the possibility of a bid with 
an unbounded patience. 

Theorem 5 No strongly truthful, individual-rational, no-deficit, feasible, anonymous dy- 
namic DA can be reasonable when a bidder's patience can be unbounded. 

Proof: Fix any > 1, L > 1, and for the number of high- valued phases, T/j > 1, set the 
departure of a high-patience agent to d = {Th + 1)L. Keep vh > vl > 0, e > 0, and a > 1 
as variables to be set within the proof. Assume a dynamic DA is reasonable, so that it 
selects at least one new bid-ask pair to trade in each phase. Consider phase k = and with 
A^ — 1 agents of types {1,L,vh), N of type {1,L, —{vh — e)) and 1 agent of patient type, 
(1, (T/i -I- 1)L, avn)- If the patient bid deviates to (1, L, vh), then the bids are all identical, 
and with probability at least 1/N the bid would win by anonymity and reasonableness. 
Also, by anonymity, individual-rationality and no-deficit we have that the payment made 
by any winning bid is the same, and must be p' € [vh — e, vh] ■ (If the payment had been less 
than this, the DA would run at a deficit since the sellers require at least this much payment 
for individual-rationality.) Condition now on the case that the patient bid would win if it 
deviates and reports {1,L,vh)- Suppose the bidder is truthful, reports (1, {Th -\- l)L,avH) 
but does not trade in this phase. But, if phase A; = is the last phase with new bids and 
asks, then the bid will not be able to trade in the future and for strong-truthfulness the 
DA would need to make a payment of at least avn — = (o: — l)w/f in a later phase to 
prevent the bid having a useful deviation to (1, L, vh) and winning in phase k = 0. But, if: 

A^e < (a - 1)VH, (8) 
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then the DA cannot make this payment without failing no-deficit (because Ne is an upper- 
bound on the surplus the auctioneer could extract from bidders in this phase without 
violating individual-rationality). We will later pick values of a,e and vh, to satisfy Eq. (8). 
So, the bid must trade when it reports (1, {Th + l)L, avn), in the event that it would win 
with report {1,L,vh), as "insurance" against this being the last phase with new bids and 
asks. Moreover, it should trade for payment, p' € [vh — e, vh], to ensure an agent with true 
type {1,L,vh) cannot benefit by reporting (1, (Th + l)L,avH)- 

Now suppose that this was not the last phase with new bids and asks, and > I. 
Now consider what would happen if the patient bid in phase k = deviated and reported 
(1 + Tf^L, (Tfi + 1)L,vl). As before, this bid would win with probability at least 1/N by 
anonymity and reasonableness, but now with some payment p" G [vl — e,t'L]- Condition 
now on the case that the patient bid would win, both with a report of (1, L, vh) and with a 
report of (1 -t-T/jL, [T^ + 1)L,vl)- When truthful, it trades in phase k = with payment at 
least Vh — e. If it had reported (1 + ThL, (Th + 1)L, vl), it would trade in phase k = Th for 
payment at most vl- For strong truthfulness, the DA must make an additional payment to 
the patient agent of at least (vh — vl) — (vh — (vh — ^)) = vh — vl — ^- But, suppose that 
the high and low values are such that, 

(Th + l)A^e <VH-VL-e. (9) 

Making this payment in this case would violate no-deficit, because (Th+l)Ne is an upper- 
bound on the surplus the auctioneer can extract from bidders across all phases, including 
the current phase, without violating individual-rationality. But now we can fix any vl > 0, 
€ < Vl and choose vh > (Th + 1)A^€ + vl + e to satisfy Eq. (9) and a > (Ne/vn) + 1 
to satisfy Eq. (8). Thus, we have proved that no truthful dynamic DA can choose a bid- 
ask pair to trade in period A; = 0. The proof can be readily extended to show a similar 
problem with choosing a bid-ask pair in any period k < Th, by considering truthful type of 
(l + kL,(Th + l)L,avH). □ 

To drive home the negative result: notice that the number of efficient trades can be 
increased without limit by choosing an arbitrarily large Th, and that no truthful, dy- 
namic DA with these properties will be able to execute even a single trade in each of 
these {0, . . . , T/j — 1} periods. Moreover, we see that only a vanishingly small fraction of 
high-patience agents is required for this negative result. The proof only requires that at 
least one patient agent is possible in all of the high-valued phases. 

6. Experimental Analysis 

In this section, we evaluate in simulation each of the CHAiN-based DAs introduced in 
Section 4. We measure the allocative efficiency (total value from the trades), net efficiency 
(total value discounted for the revenue that flows to the auctioneer), and revenue to the 
auctioneer. All values are normalized by the total offline value of the optimal matching. 

For comparison we also implement several other matching schemes: the truthful, surplus- 
maximizing matching algorithm presented by Blum et al. (2006), an untruthful greedy 
matching algorithm using truthful bids as input to provide an upper-bound on performance, 
and an untruthful DA populated with simple adaptive agents that are modeled after the 
Zero-intelligence Plus trading algorithm that has been leveraged in the study of static 
DAs (Cliff k Bruten, 1998; Preist & van Tol, 1998). 
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6.1 Experimental Set-up 

Traders arrive to the market as a Poisson stream to exchange a single commodity at discrete 
moments. This is a standard model of arrival in dynamic systems, economic or otherwise. 
Each trader, equally likely to be a buyer or seller, arrives after the previous with an expo- 
nentially distributed delay, with probability density function (pdf): 

f{x) = Ae"^^, x>0, (10) 

where A > represents the arrival intensity in agents per second. Later we present results 
as the interarrival time, j, is varied between 0.05 and 1.5; i.e., as the arrival intensity is 
varied between 20 and |. A single trial continues until at least 5,000 buyers and 5,000 sellers 
have entered the market. In our experiments we vary the maximal patience K between 2 
and 10. For the distribution on an agent's activity period (or patience, di — ai), we consider 
both a uniform distribution with pdf: 

/(x) = -i, xG[0,K], (11) 

and a truncated exponential distribution with pdf: 

/(x) = ae-"^ xG[0,K], (12) 

where a = — ln(0.05)/ii' so that 95% of the underlying exponential distribution is less than 
the maximal patience. Both arrival time and activity duration are rounded to the nearest 
integral time period. A trader who arrives and departs during the same period is assumed 
to need an immediate trade and is active for only one period. 

Each trader's valuation represents a sample drawn at its arrival from a uniform distri- 
bution with spread 20% about the current mean valuation. (The value is positive for a bid 
and negative for an ask.) To simulate market volatility, we run experiments that vary the 
average valuation using Brownian motion, a common model for valuation volatility upon 
which many option pricing models are based (Copeland &; Weston, 1992). At every time 
period, the mean valuation randomly increases or decreases by a constant multiplier, e^''', 
where 7 is the approximate volatility and varied between and 0.15 in our experiments. 

We plot the mean efficiency of 100 runs for each experiment, with the same sets of bids 
and asks used across all double auctions. All parameters of an auction rule arc rcoptimized 
for each market environment; e.g., we can find the optimal fixed price and the optimal 
smoothing parameters offline given the ability to sample from the market model. 



6.2 Chain Implementation 

We implement Chain for the five price-based matching rules (history-clearing, history- 
median, history-McAfee, history-EWMA, and fixed-price) and the three competition-based 
matching rules (McAfee, active-McAfee, and windowed-McAfee). 

The price-based implementations keep a fixed-size set of the most recently expired, 
traded, or priced-out offers, H*. Offers priced-out by their admission prices are inserted 
into prior to computings*. The history-clearing metric computes a price to maximize the 
number of trades to agents represented by had they all been contemporary. The history- 
median metric chooses the price to be the median of the absolute valuation of the offers 
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in H*. The history-McAfee method computes the "McAfee price" for the scenario where 
all agents represented by are simultaneously present. The EWMA metric computes an 
exponentially-weighted average of bids in the order that they expire, trade, or price out. 
The simulations initialize the price to the average of the mean buy and sell valuations. If 
two bids expire during the same period, they are included in arbitrary order to the moving 
average. 

None of the metrics require more than one parameter, which is optimized offline with 
access to the model of the market environment. Parameter optimization proceeds by uni- 
formly sampling the parameter range, smoothing the result by averaging each result with 
its immediate neighbors. The optimization repeats twice more over a narrower range about 
the smoothed maximum, returning the parameter that maximizes (expected) allocative effi- 
ciency. None of the price-based methods appeared to be sensitive to small (<10%) changes 
in the size of H*. With most simulations, the window size was chosen to be about 150 of- 
fers. For EWMA, the smoothing factor was usually chosen to be around 0.05 or lower. The 
windowed-McAfee matching rule, however, was extremely sensitive to window size for simu- 
lations with volatile valuations, and the search process frequently converged to suboptimal 
local maxima. 

The admission price in the price-based methods is computed by first determining 
whether MATCH would check the value of the bid against bid price if the bid had arrived in 
some earlier period t'. Rather than simulate the entire Match procedure, it is sufficient to 
determine the probability pi of this event. This is determined by checking the construction 
of the strong no-trade sets in that earlier period. If SNT* contains non-departing buyers 
(sellers), then the probability that an additional seller (buyer) would be examined is 1 and 
Pi = 1. Otherwise the probability is equal to the ratio of the number of bids (asks) ex- 
amined not included in SNT* and one more than the total number of bids (asks) present. 
Finally, with probability pi the price the agent would have faced in period t' is defined as 
(— for sellers), and otherwise it is —oo. Here, p* is the history-dependent price defined 
in period t'. 

The competition-based matching rules price out all non-trading bids at the end of each 
period in which trade occurs (because of the definition of strong no-trade in that context). 
The admission prices are calculated by considering the price that a bid (ask) would have 
faced in some period t' before its reported arrival. In such a period, the price for a bid (ask) 
is determined by inserting an additional bid (ask) with valuation oo (0) and applying the 
competition-based matching rule to that (counterfactual) state. From this we determine 
whether the agent would win for its reported value, and if so what price it would face. 

6.3 Optimal Offline Matching 

We use a commercial integer program solver (CPLEX^) to compute the optimal offline 
solution, i.e. with complete knowledge about all offers received over time. In determining 
the offline solution we enforce the constraint that a trade can only be executed if the activity 
periods of both buyer, i, and seller, j, overlap, 

(oi < dj) A {ttj < di) (13) 

6. www.ilog.com 
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An integer-program formulation to maximize total value is: 

max Xij{wi + Wj) (14) 

{i,j)Goverlap 

s.t. < Xij < 1, Vj G ask 

i:{i,j)€overlap 

< Xij < 1, Vz G bid 

Xij e {0,1}, Vz, j, 

where G overlap is a bid-ask pair that could potentially trade because they have 

overlapping arrival and departure intervals satisfying Eq. (13). The decision variable Xij G 
{0, 1} indicates that bid i matches with ask j. This provides the optimal, offline allocative 
efficiency. 



6.4 Greedy Online Matching 

We implement a greedy matching algorithm that immediately matches offers that yield non- 
negative budget surplus. This is a non-truthful matching rule but provides an additional 
comparison point for the efficiency of the other matching schemes. During each time period, 
the greedy matching algorithm orders active bids and asks by their valuations, exactly as 
the McAfee mechanism does, and matches offers until pairs no longer generate positive 
surplus. The algorithm's performance allows us to infer the number of offers that the 
optimal matching defers before matching and the amount of surplus lost by the McAfee 
method due to trade reduction and due to the additional constraint of admission pricing. 

6.5 Worst- Case Optimal Matching 

Blum et al. (2006) derive a mechanism equivalent to our fixed-price matching mechanism, 
except that the price used is chosen from the cumulative distribution 

g(x) = -lnf ."""— ), (15) 
ra \{r-l)WminJ 

where r is the fixed point to the equation 

'r = ^\ -f , (16) 

\{r - l)Wmin J 

and Wmin ^ and Wmax ^ are the minimum and maximum absolute valuations of all 
traders in the market. For our simulations, we give the mechanism the exact knowledge of 
the minimum and maximum absolute valuations for each schedule. Blum et al. (2006) show 
that this method guarantees an expected competitive ratio of max(2, ln(';«„iax/^^'mm)) with 
respect to the optimal offline solution in an adversarial setting. We were interested to see 
how will this performed in practice in our simulations. 
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6.6 Strategic Open-outcry Matching: ZIP Agents 

To compare Chain with the existing hterature on continuous double auctions, we implement 

a DA that in every period sorts all active offers and matches the highest valued bids with 
the lowest valued asks so long as the match yields positive net surplus. The DA prices each 
trading pair at the mean of the pair's declared valuations. Since the trade price depends 
on a bidder's declaration, the market does not support truthful bidding strategies. We must 
therefore adopt a method to simulate the behavior of bidding agents within this simple open 
outcry market. 

For this, we randomly assign each bid to one of several "protocol agents" that each use 
a modified ZIP trading algorithm, as initially presented by Cliff and Bruten (1998) and 
improved upon by Preist and van Tol (1998). The ZIP algorithm is a common benchmark 
used to compare learned bidding behavior in a simple double-auction trading environment 
in which agents are present at once and adjust their bids in seeking a profitable trade. We 
adapt the ZIP algorithm for use in our dynamic environment. 

In our experiments we consider five of these protocol agents. New offers are assigned 
uniformly at random to a protocol agent, which remains persistent throughout the simula- 
tion. Each offer is associated with a patience category, k € {low, medium, high}, defined 
to evenly partition the range of possible offer patience. Each protocol agent, j, is defined 
with parameters {rj,l3j,jj) and maintains a profit margin, jjij, on each patience category k. 
Parameters {I3j,jj) control the adaptivity of the protocol agent in how it adjusts the target 
profit margin on an individual offer, with f3j ~ C/(0. 1,0.2) defining the offer-level learning 
rate and 7j ~ C/(0.2,0.8) defining the offer-level damping factor. Parameter rj G [0,1] is 
the learning rate adopted for updating the profit margins. 

The protocol agents are trained over 10 trials and their final performance is measured 
in the 11th trial. The learning rate decreases through the training session and depends on 
the initial learning rate and the adjustment rate r~^ . In period t G {1, . . . of trial 

A;G{l,...,r-|-l}, where T = 10 is the number of trials used for training and t^^^ is the 
number of periods in trial k, the learning rate is defined as: 

r,:=l-(^r° + (fc-l)r;+(^)%;j (17) 

where = (1 — rj )/{T -\- 1). We define := 0.7. The effect of this adjustment rule is that 
rj is initially 0.3, decreases during training, and trends to 0.0 as f — )■ t^^^ in trial k = 11. 

Within a given trial, upon assignment of a new offer i in patience category k, the protocol 
agent managing the offer initializes {iii[t), 6i{t)) := {fij, 0), where fii{t) represents the target 
profit margin for the offer and 6i{t) represents a profit-margin correction term. The target 
profit margin and the profit margin correction term are adjusted for offer i in subsequent 
periods while the bid remains active. 

The target profit margin is used to define a bid price for the offer in each period while 
it remains active: 

Wi{t):=Wi{l-\-fii(t)). (18) 



164 



Chain: An Online Double Auction 



At the end of a period in which an offer matches or simply expires, the profit margin 



/i^ for its patience category is updated as: 



(19) 



where the amount of adaptivity depends on the learning rate rj. Because the profit margin 
on an offer decays over its lifetime, this update adjusts towards a small profit margin if 

the offer expires or took many periods to trade, and a larger profit margin otherwise. The 
long-term learning of a protocol agent occurs through the profit margin assigned to each 
patience category. 

At the start of a period each protocol agent also computes target prices for bids and 
asks in each patience category. These are used to drive an adjustment in the target profit 
margin for each active bid and ask. Target prices T^{t) and Tg{t) are computed as: 



and. 



(1 + 77) max {wAt — 1)} + ^ , if > max {wAt 

i&B*'{t-l) ieS{t-l) 

(1 — rj) max {wi(t — 1)} — ^ , otherwise 

i&B''{t-l) 



1)}+ max {wi{t-l)} 

ieB'=(t-l) 



(20) 



(1 + 77) max itl'iit 
ieSf'it-i) 

(1 -r?) max {wi{t - I)} 

i65*(t-l) 



1)} + ^ , if > max {wAt - 1)} + max {wAt - 1)} 
iGB(t-l) ie5'=(t-l) 



^ , otherwise 



(21) 



where ^,r) i7 (0,0.05). Here, B{t — 1) and S{t — 1) denote the set of active bids and asks 

in the market in period t — I (defined before market clearing), and B^{t — 1) and S'*^(t — 1) 
denote the restrictions to patience category k. The target price on a bid in category k 
is set to something slightly greater than the most competitive bid in the previous round 
when that bid could not trade, and slightly less otherwise. Similarly for the target price on 
asks, where these prices are negative, so that increasing the target price makes an ask more 
competitive. 

Target prices are used to adjust the target profit margin at the start of each period on 
all active offers that arrived in some earlier period, where the influence of target prices is 
through the profit-margin correction term: 



Hi{t) :-- 



{Wi{t - 1) + di{t)) 



Wi 



(22) 



and the profit-margin correction term, 6i{t), is defined in terms of the target price T^{t) 
(equal to T^{t) if i is a bid and Tg{t) otherwise) as, 

6i{t) := jj6i{t - 1) + (1 - IjWrtit) - Wi{t - 1)), (23) 

where 7^ and fij are the offer-level learning rates and damping factor. The value wi and the 
"-1" term in Eq. (22) provide normalization. Eq. (23) is the Widrow-Hoff (Hassoun, 1995) 
rule, designed to minimize the least mean square error in the profit margin and adopted 
here to mimic earlier ZIP designs. 
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6.7 Experimental Results 

Our experimental results show that market conditions drive DA choice. We compare al- 
locative efficiency, revenue, and net efficiency. All results are averaged over 100 trials. 
In experiments we found only minimal qualitative differences between the use of the two 
patience distributions. The uniform patience distribution provides a slight increase in ef- 
ficiency over result using exponential patience, caused by a larger proportion of patient 
agents which relaxes somewhat the admission-price constraint in Eq. (3). For this reason 
we choose to report only results for the uniform patience distribution. 

While the performance of all methods are summarized in Table 3, we omit the perfor- 
mance of some markets from the plots to keep the presentation of results as clear as possible. 
We do not plot the price-based results for the median- or clearing-based prices because the 
performance was typically around that of the performance of Chain instantiated on the 
history-EWMA price. We do not plot the windowed-McAfee results because of inconsis- 
tent performance, and in most cases, upon manual inspection, it was optimal to choose the 
smallest possible window size, i.e. including only active bids and making it equivalent to 
active-McAfee. 

Our plots also leave out the performance of the Blum et al. (2006) worst-case optimal 
matching scheme because it was dominated by the fixed-price Chain instantiation and in 
many cases failed to yield any substantial surplus. Wc note here that the modeling assump- 
tion made by Blum et al. (2006) is quite different than that in our work: they worry about 
performance in an adversarial environment while we consider probabilistic environments. 
Our fixed-price Chain mechanism operates essentially identically to the surplus-maximizing 
scheme of Blum et al. (2006), except that CHAIN can also use additional statistical informa- 
tion to set the ideal price, rather than drawing the price from a distribution that is used to 
guarantee worst-case performance against an adversary. We defer the results for the Blum 
et al. (2006) scheme to Table 3. 

Figures 5-8 plot results from two sets of experiments, one for high-patience/low- volatility 
and one for low-patiencc/high- volatility, as we vary the inter-arrival time (and thus the 
arrival intensity), volatility and maximal patience. All plots are for allocative efficiency 
except Figure 6, where we consider net efficiency. Active-McAfee is included on Figure 5, 
but not on any other plots because it did not improve upon the McAfee performance in 
the other environments. To emphasize: the results for greedy provide an upper-bound on 
the best possible performance because this is a non-truthful algorithm, simulated here with 
truthful inputs. 

In Figure 5 (left) we see that from within the truthful DAs, the McAfee-based DA has 
the best efficiency for medium to low arrival intensities. There also is a general decrease 
in performance, relative to the optimal offline solution, as the arrival intensity falls. This 
trend, also observed with the greedy (non-truthful) DA, occurs because the Chain scheme 
is myopic in that it matches as soon as the static DA building block finds a match, while 
it is better to be less myopic when arrival intensity is low. The McAfee-based DAs are less 
sensitive to this than other methods because they can aggressively update prices using the 
active traders. The price-based DAs experience inefficiencies due to the lag in price updates 
because they use only expired, traded, and priced-out offers to calculate prices. 
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Figure 5: Allocative efficiency vs. inter-arrival time (1 / intensity) for several DAs. The left 
plot shows high-patience, low- volatility simulations, whereas the right plots results from 
low-patience, high-volatility runs. Both sets of experiments use uniform patience distri- 
butions. 
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Figure 6: Net efficiency vs. inter-arrival time (1 / intensity) for several DAs. The left plot 
shows high-patience, low-volatility simulations, whereas the right plots results from low- 
patience, high-volatility runs. Both sets of experiments use uniform patience distribu- 
tions. 



167 



Bredin, Parkes and Duong 



For very high arrival intensity we see Active-McAfee dominates McAfee. Active-McAfee 
smooths the price, which helps to mitigate the impact of fluctuations in cost on the admission 
price via Eq. (3) in return for less responsiveness. This is helpful in "well-behaved" markets 
with high arrival intensity and low volatility but was not helpful in most environments we 
studied, where the additional responsiveness provided by the (vanilla) McAfee scheme paid 
off. 

The ZIP market also has good performance in this high-patience/low- volatility environ- 
ment. The reason is simple: this is an easy environment for simple learning agents, and the 
agents quickly learn to be truthful. We emphasize that these ZIP market results should be 
treated with caution and are certainly optimistic. This is because the ZIP agents are not 
programmed to consider timing-based manipulations. The effect in this environment is that 
the ZIP market tends to operate as if a truthful market, but without the cost of imposing 
truthfulness explicitly via market-clearing rules. By comparison the Chain auctions are 
fully strategyproof, to both value and temporal manipulations. 

Compare now with Figure 5 (right), which is for low patience and high volatility. Now we 
see that McAfee dominates across the range of arrival intensities. Moreover, the performance 
of ZIP is now quite poor because the agents do not have enough time to adjust their bids 
(patience is low) and high volatility makes this a more difficult environment. With volatile 
valuations, the possibility of valuation swings leaves open the possibility of larger profits, 
luring agents to set wider profit margins, but only after the market changes. The ZIP agents 
also have fewer concurrent competitive offers to use in setting useful price targets during 
learning. As we might expect, high volatility also negatively impacts the efficiency of the 
fixed-price scheme. 

In Figure 6 we see that the net efficiency trends are qualitatively similar except that the 
competition-based DAs such as McAfee fare less well in comparison with the price-based 
DAs. The auctioneer accrues more revenue for competition-based matching rules such as 
McAfee because they often generate buy and sell prices with a spread. Together with 
the competition-based schemes being intrinsically more dynamic, this drives an increased 
price spread in Chain via the admission price constraints. In Figure 6 (left) we see that 
the fixed-price scheme performs well for high arrival intensity while EWMA dominates for 
intermediate arrival intensities. The McAfee scheme is still dominant for lower patience and 
higher volatility (Figure 6, right). 

To reinforce these observations, in Table 3 we present the the net efficiency, allocative 
efficiency and (normalized) revenue across all arrival intensities (i.e. inter-arrival time from 
0.05 to 1.5) and for both low and high volatility trials. All five price-based methods, all 
three competition-based methods, and all three comparison methods are included. We 
highlight the best performing competition-based method, price-based method, as well as 
the performance of the ZIP market (skipping over the non-truthful, greedy algorithm) . We 
omit information about the mean standard error for each measurement because in no case 
did this error exceed a tenth of a percent of the mean optimal surplus. From within the 
truthful DAs, we see that the McAfee-based scheme dominates overall for both allocative 
and net efficiency and both low and high volatility, although EWMA competes with McAfee 
for net efficiency in low volatility markets. Notice also the good performance of the ZIP- 
based market (with the aforementioned caveat about the restricted strategy space) at low 
volatilities. 
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scenario 



low- volt /hi^ 
net alloc 



;li-pat 
rev 



high- 
net 



•volt/low-pat 
alloc 



rev 



mcafee 

active-mcafee 
windowcd-mcafee 




0.40 

0.32 
0.21 



0.45 

0.37 

0.23 



0.05 

0.05 
0.03 



history-clearing 

history-ewma 

history-fixed 

history-mcafee 

history- median 

blum et al. 

greedy 

zip 



0.10 
0.86 
0.82 



0.10 
0.86 
0.82 



0.00 
0.00 
0.00 



0.17 
0.19 

0.04 
0.15 
0.17 
0.02 
0.87 
0.23 



0.17 
0.22 

0.04 
0.16 
0.18 
0.02 
0.87 
0.23 



0.01 

0.03 

0.00 

0.01 

0.01 

0.00 

0.00 

0.00 



Table 3: Net cfSciency, allocativc efficiency and auctioneer revenue (all normalized by the optimal 
value from trade), averaged across all arrival intensities (0.05-1.5) and for low and high 
value volatility. The best performing competition-based, price-based and 'other' (ignoring 
greedy, which is not truthful) results are highlighted. 
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Figure 7: Allocative efHciency vs. volatility for several DAs for a fairly low arrival intensity. The 
left plot is for large maximal patience and the right plot is for small maximal patience. 
Both sets of experiments use uniform patience distributions. 



Figure 7 plots allocative efficiency versus volatility for high patience (left) and low 
patience (right) and for fairly low arrival intensity. Higher volatility hurts all methods - 
especially the ZIP agents, which struggle to learn appropriate profit and price targets, 
probably due to few opportunities to update prices for every individual offer. The McAfee 
scheme fairs very well, showing good robustness for both large patience and small patience 
environments. The fixed-price scheme has the best performance when there is zero volatility 
but its efficiency falls off extremely quickly as volatility increases. 
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Figure 8: Allocative efficiency vs. maximal patience for several DAs and fairly low arrival intensity. 

The left plot is for low volatility and the right plot is for high volatility. Both sets of 
experiments use uniform patience distributions. 

We also consider the effect of varying maximal patience. This is shown in Figure 8, 
with low volatility (left) and high volatility (right). Again, the McAfee scheme is the best 
of the truthful DAs based on Chain. We also sec that the performance of ZIP improves 
as patience increases due to more opportunities for learning. Perversely, a larger patience 
can negatively affect the truthful DAs. In part this is simply because the performance of 
greedy online schemes, relative to the offline optimal, decreases as patience increases and 
the offline optimal matching is able to draw more benefit from its lack of myopia. 

We also suspected another culprit, however. The possibility of the presence of patient 
agents requires the truthful DAs to include additional terms in the max operator in Eq. (3) 
to prevent manipulations, leading to higher admission prices and less admitted offers. To 
better understand this effect we experimented with delayed market clearing in the McAfee 
scheme, where the market matches agents only every r-th period (the "clearing duration"). 
The idea is to make a tradeoff between using fewer admission prices and the possibility that 
we will miss the opportunity to match some impatient offers. 

Figure 9 shows allocative efficiency when the matching mechanism clears less frequently 
and for different maximal patience, K. Figure 9 (left) is for low volatility. There we 
see that the best clearing duration is roughly 1, 2, 3 and 4 for maximal patience of K e 
{4, 6, 8, 10} and that by optimizing the clearing duration the performance of McAfee remains 
approximately constant as maximal patience increases. In Figure 9 (right) we consider the 
effect in a high volatility environment, with these results averaged over 500 trials because the 
performance of the DA has higher variance. We see a qualitatively similar trend, although 
higher maximal patience now hurts overall and cannot be fully compensated for by tuning 
the clearing duration. 

7. Related Work 

Static two-sided market problems have been widely studied (Myerson &: Sattcrthwaitc, 1983; 
Chatterjee & Samuelson, 1987; Satterthwaite &; Williams, 1989; Yoon, 2001; Deshmukh 
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Figure 9: Allocative efficiency vs. clearing duration in the McAfee-based Chain auction for fairly 
low arrival intensity and as maximal patience is varied from 4 to 10. The left plot is 
for low volatility and the right plot is for high volatility. Both sets of experiments use 
uniform patience distributions. 



et al., 2002). In a classic result, Myerson and Satterthwaite proved that it is impossible to 
achieve efHciency with voluntary participation and without running a deficit, even relaxing 
dominant-strategy equilibrium to a Bayesian-Nash equilibrium. Some truthful DAs are 
known for static problems (McAfee, 1992; Huang et al., 2002; Babaioff k Nisan, 2004; 
BabaiofF k. Walsh, 2005). For instance, McAfee introduced a DA that sometimes forfeits 
trade in return for achieving truthfulness. McAfee's auction achieves asymptotic efficiency 
as the number of buyers and sellers increases. Huang et al. extend McAfee's mechanism 
to handle agents exchanging multiple units of a single commodity. BabaiofF and colleagues 
have considered extensions of this work to supply-chain and spatially distributed markets. 

Our problem is also similar to a traditional continuous double auction (CDA), where 
buyers and sellers may at any time submit offers to a market that pairs an offer as soon 
as a matching offer is submitted. Early work considered market efficiency of CDAs with 
human experiments in labs (Smith, 1962), while recent work investigates the use of software 
agents to execute trades (Rust et al., 1994; Cliff & Bruten, 1998; Gjerstad & Dickhaut, 
1998; Tesauro & Bredin, 2002). While these markets have no dominant strategy equilibria, 
populations of software trading agents can learn to extract virtually all available surplus, 
and even simple automated trading strategies outperform human traders (Das et al., 2001). 
However, these studies of CDAs assume that all traders share a known deadline by which 
trades must be executed. This is quite different from our setting, in which we have dynamic 
arrival and departure. 

Truthful one-sided online auctions, in which agents arrive and depart across time, have 
received some recent attention (Lavi & Nisan, 2000; Hajiaghayi et al., 2004, 2005; Porter, 
2004; Lavi & Nisan, 2005). We adopt and extend the monotonicity-based truthful charac- 
terization in the work of Hajiaghayi et al. (2005) in developing our framework for truthful 
DAs. Our model of DAs must also address some of the same constraints on timing that 
occur in Porter, Hajiaghayi, and Lavi and Nisan's work. In these previous works, the items 
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were reusable or expiring and could only be allocated in particular periods. In our work we 
provide limited allowance to the match-maker, allowing it to hold onto a seller's item until 
a matched buyer is ready to depart (perhaps after the seller has departed). 

The closest work in the literature is due to Blum et al. (2006) , who present online match- 
ing algorithms for the same dynamic DA model. The main focus in their paper is on the 
design of matching algorithms with good worst-case performance in an adversarial setting, 
i.e. within the framework of competitive analysis. Issues related to incentive compatibility 
receive less attention. One way in which their work is more general is that they also study 
goals of profit and maximizing the number of trades, in addition to the goal of maximizing 
social welfare that we consider in our work. However, the only algorithmic result that they 
present that is truthful in our model (where agents can misreport arrival and departure) is 
for the goal of social welfare. The DA that they describe is an instance of Chain in which 
a fixed price is drawn from a distribution at the start of time, and used as the matching 
price in every period. Perhaps unsurprisingly, given their worst-case approach, we observe 
that their auction performs significantly worse than Chain defined for a fixed price that is 
picked to optimize welfare given distributional information about the domain. 

8. Conclusions 

We presented a general framework to construct algorithms to match buyers with sellers in 
online markets where both valuation and activity-period information arc private to agents. 
These algorithms guarantee truthful dominant strategies by first imposing a minimum ad- 
mission price for each offer and then pricing and pairing the offer at the first opportunity. 
At the heart of the Chain framework lies a pricing algorithm that must for each offer either 
determine a price independent of any information describing the offer or choose to discard 
the offer. The pricing algorithm should be chosen to match market conditions. We present 
several examples of suitable pricing schemes, including fixed-price, moving-average, and 
McAfee-based schemes. 

More often than not, we find that the competition-based scheme that employs a McAfee- 
based rule to truthfully price the market delivers the best allocative efficiency. For excep- 
tionally low volatility and high arrival intensity, we find that adaptive price-based schemes 
such as an exponentially- weighted moving average (EWMA) and even fixed price schemes 
perform well. We see qualitatively similar results for net efficiency, where the revenue that 
accrues to the auctioneer is discounted, albeit that the price-based rules such as EWMA 
have improved performance because they have no price spread. The observations are rooted 
in simulations comparing the market efficiency under each mechanism with the optimal of- 
fline solution. 

Additionally, we compare the efficiency of our truthful markets with a fixed-price worst- 
case optimal scheme presented by Blum et al. (2006), a market of strategic agents using 
a variant on the ZIP price update algorithm developed by Cliff and Bruten (1998) for 
continuous double auctions, and a non-truthful, greedy matching algorithm to provide an 
upper-bound on performance. The best of our schemes yield around 33% net efficiency 
in low volatility, high patience environments and 40% net efficiency in high volatility, low 
patience environments, while the greedy bound suggests that as much as 86% efficiency 
is possible with non-strategic agents. We note that the Blum et al. scheme, designed for 
adversarial settings, fairs poorly in our simulations (< 10%). 
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One can axgue, we think convincingly, that truthfulness brings benefits in itself in that it 
avoids the waste of costly counterspeculation and promotes fairness in markets (Sandholm, 
2000; Abdulkadiroglu et al., 2006). On the other hand, it is certainly of interest that 
the gap between the efficiency of greedy matching with non-truthful matching and that of 
our truthful auctions is so large. Here, we observe that the ZIP-populated (non-truthful) 
markets achieve around 82% efficiency in low volatility environments but collapse to around 
23% efficiency in high volatility environments. Based on this, one might conjecture that 
designing for truthfulness is especially important in badly behaved, highly volatile ("thin") 
environments but less important in well behaved, less volatile ("thick") environments. 

Formalizing this tradeoff between providing absolute truthfulness and approximate 
truthfulness, and while considering the nature of the environment, is an interesting di- 
rection for future work (see paper by Parkes et al., 2001). Given that reporting of market 
statistics can be incorporated within our framework (see Section 5.1), and given that mar- 
kets also play a role in information aggregation and value discovery, future research should 
also consider this additional aspect of market design. Perhaps there is an interesting tradeoff 
between efficiency, truthful value revelation, and the process of information aggregation. 

While the general Chain framework achieves good efficiency, further tuning seems pos- 
sible. One direction is to adopt a meto-pricing scheme that chooses, or blends, prices from 
competing algorithms. Another direction is to consider richer temporal models; e.g., the 
value of goods to agents might decay or grow over time to better account for the time 
value of assets. A richer temporal model might also consider the possibility of agents or the 
match-maker taking short positions (including short-term cash deficits) to increase trade. 
It is also interesting to extend our work to markets with non-identical goods and more 
complex valuation models such as bundle trades (Chu & Shen, 2007; Babaioff &: Walsh, 
2005; Gonen et al., 2007), and to dynamic matching problems without prices, such as an 
online variation of the classic "marriage" problem (Gusfield &; Irving, 1989). 
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Appendix: Proofs 

Lemma 1 Procedure Match defines a valid strong no-trade construction. 
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Proof: In all cases, SNT* C NT*. The set NT* is correctly constructed: equal to all 
remaining bids 6* when (j = 0) in Case I, all remaining bids s* when (i = 0) in Case II, 
and all remaining bids and asks otherwise. In each case, no bid (or ask) in NT* could have 
traded at any price because there was no available bid or ask on the opposite of the market 
given its order. 

In verifying strong no-trade (SNT) conditions (a) and (b), we proceed by case analysis. 
Case I. {i / 0) and (j = 0). NT* := 6*. 

(I-l) VA; G s* • (4 = t) and SNT* := 6*. For SNT-a, consider / G NT* with di > t. If 
I deviates and i changes but we remain in Case I then NT* is unchanged and still 
contains If / deviates and i — )■ then, we go to Case III and SNT* := 6* U s* and still 
contains I. For SNT-b, consider I G SNT* that deviates with di > t. Again, either we 
remain in this case and SNT* is unchanged or i — > and we go to Case III. But now 
SNT* still contains all 6* and is therefore unchanged for all agents with dk > t. 

(1-2) Buyer G 6* with 4 = t and bk > p* and SNT* := 6*. For SNT-a, consider I G NT* 
with di > t. We remain in this case for any deviation by buyer I because buyer k 
will ensure i 0, and so SNT* remains unchanged and still contains I. For SNT-b, if 
I G SNT* with di> t deviates we again remain in this case and SNT* is unchanged. 

(1-3) Some seller with d^ > t and no buyer with dk' = t willing to accept the price. 
SNT* := 6* \ checkedfi. For SNT-a, consider I G NT* with di > t. First, suppose 
I G checked^ and i ^ I. If Z deviates but still has di > t, then even if i := I then we 
remain in this case and I does not enter SNT*. Second, suppose I G checked^ and 
{i = I). If I deviates but still has di > t, then even if (i = 0) and (j = 0), wc go 
to Case III and SNT* = and Z does not enter SNT*. Third, suppose I ^ checked^ 
and di > t. Deviating while di > t has no effect and we remain in this case and / 
remains m SNT*. For SNT-b, consider I G SNT* with di > t, i.e. with I ^ checked^. 
If I deviates but di > t, then this has no effect and we remain in this case and SNT* 
remains unchanged. 

Case II. {j / 0) and (i = 0). NT* := s*. Symmetric with Case I. 
Case in. {i = 0) and {j = 0). NT* := 6* U s*. 

(III-l) yk £ ■ (4 = t) but 3k' e s* • (4' > t) and SNT* := 6* U .s*. For SNT-a, consider 
/ G NT* with di > t. This must be an ask. If I deviates but we remain in this case, 
then I remains in SNT*. If j := I, then we go to Case II and SNT* := s* and I remains 
in SNT*. For SNT-b, consider I G SNT* with di > t, which must be an ask. If I 
deviates but we remain in this case, SNT* is unchanged. If / deviates and j := I, then 
we go to Case II, SNT* := s*, and buyers 6* arc removed from SNT*. But this is OK 
because all buyers depart in period t anyway. 

(III-2) VA; G s*cdot{dk = t) but 3k' G 6* • (4' > t) and SNT* := 6* U s*. Symmetric to Case 
III-l. 

(III-3) \fk £ b^ ■ (4 = t) and VA; G s^cdot{dk = t). SNT* := 6* U s*. SNT-a and SNT-b are 
trivially met because no bids or asks have departure past the current period. 
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(III-4) 3k G 6* • (4 > t) and 3k' G s* • (4' > t) and SNT* := 0. For SNT-a, consider I G NT* 
with di > t. Assume that I is a bid. If / deviates and di > t and i = then we remain 
in this case and I is not in SNT*. If I deviates and di > t but i := I, then we go to 
Case I and we are necessarily in Sub-case (I-a) because di > t and there can be no 
other bid wilhng to accept the price (else i / in the first place). Thus, we would 
have SNT* := &* \ checked^ and I would not be in SNT*. For SNT-b, this is trivially 
satisfied because there are no agents I G SNT*. 

□ 

Lemma 5 The set of active agents ( other than i) in period t in Chain is independent of 
i 's report while agent i remains active, and would he unchanged if i 's arrival is later than 
period t. 

Proof: Fix some arrival period Oj. Show for any aj > aj, the set of active agents in period 
t > Oj while i is active is the same as A* without agent i's arrival until some > t. Proceed 
by induction on the number of periods that t is after a^. For period t = ai this is trivial. 
Now consider some period Oj + r, for some r > 1 and assume the inductive hypothesis for 
+ r — 1. Since i is still active then, i G SNT* for t' = Oj + r — 1, and therefore the other 
agents in SNT* that survive into this period are independent of agent i's report by strong 
no-trade condition (b). This completes the proof. □ 

Lemma 6 The price constructed from admission price q and post- arrival price p is value- 
independent and monotonic-increasing when the matching rule in Chain is well-defined, 
the strong no-trade construction is valid, and agent patience is bounded by K. 

Proof: First fix di,di and 9-i. To show value-independence (Bl), first note that q is 
value-independent, since whether or not i G SNT* in some pre-arrival period t is value- 
independent by strong no-trade condition (a) and price Zi{H^,A^ \ i^uj) in such a period 
is agent-independent by definition. Term p is also value-independent: the decision period 
t* to agent i, if any, is independent of Wi since the other agents that remain active are 
independent of agent i while it is active by Lemma 5, and whether or not i G SNT* is 
value-independent by strong no-trade (a); and the price in is value-independent when 
the set of other active agents arc valuc-indcpcndcnt. 

Now fix 9-i and show the price is monotonically-increasing in a tighter arrival-departure 
interval (B2). First note that q is monotonic-increasing in [a^, dj] C [oj, di] because an earlier 
di and later O j increases the domain t G [dj — K, ai — 1] on which q is defined. Fix some 
ai > ai. Argue the price increases with earlier d[ < di, for any di > Ui. To see this, note that 
either di < t* and so Pi(di, d[, 6-i, w) = oo for all d[< di, or di > t* and the price is constant 
until di < t* at which point it becomes oo. Fix some di > Oi. Argue the price increases with 
later a'^ > ai, where ai > di — K. First, while a- < t*, then p is unchanged by Lemma 5. 
The interesting case is when a'- > t*, especially when q{ai,di,9-i,Lo) < p(ai,di,9^i,io). 
By reporting a later arrival, the agent can delay its decision period and perhaps hope to 
achieve a lower price. But, note that in this case t* G [di — K,a'j^ — 1] since di — K < Oi and 
t* G [oi, a'- — 1] and so q{a,[, di, 9^i,u}) > p{H**,A** \ i, uj) because q now includes the price 
in period t* since i ^ SNT** in that pre-arrival period by Lemma 5. Overall, we see that 
although p may decrease, max(g,p) cannot decrease. □ 



175 



Bredin, Parkes and Duong 



Lemma 7 A strongly truthful, canonical dynamic DA must define price pi{ai,di,6-i,uj) > 
Zi{H^*,A^* \ i,uj) where t* is the decision period for bid i (if it exists). Moreover, the bid 
must be priced-out in period t* if it is not matched. 

Proof: (a) First, suppose Zi{H*'* , A''* \ i,uj) > wi but bid i is not priced-out and instead 
survives as an active bid into the next period. But with i ^ SNT**, the set of active bids in 
period t* +1 need not be independent of agent i's bid and the price Zi{H**^^ , A^*^^ \ i,u) 
need not be agent-independent. Yet, canonical rule (iii) requires that this price be used 
to determine whether or not the agent matches, and so the dynamic DA need not be 
truthful, (b) Now assume for contradiction that pi{ai,di,6-i,uj) < Zi{H^*.,A^* \ i,uj). First, 
if Zi{H^*,A^*\i, uj) < oo, then an agent with value Pi{ai, di, 9-i, oj) < Wi < Zi{H^*,A**\i, uj) 
will report Wi = Zi{H^*,A** \i,co) + e and trade now for a final payment less than its true 
value (whereas it would be priced-out if it reported its true value). If Zi{H^* ,A^* \i, uj) = oo, 
then Pi{ai, di, 6-i, u) < Zi{H** , A**\i, lo) implies that some bids will survive this period even 
though they are priced-out by the matching rule and not in the strong no-trade set. This 
compromises the truthfulness of the dynamic DA, as discussed in part (a). □ 

Lemma 8 A strongly truthful, canonical and individual-rational dynamic DA m,ust define 
price pi{di,di,0-i,uj) > q{di,di,6-i,Lo), and a bid with Wi < q{ai,di,6-i,uj) must be priced- 
out upon admission. 

Proof: Suppose di < ai -\- K so that [di — K,ai — 1] is non-empty. For di = ai + K — 1, 
when t = di — K is a decision period (and i ^ SNT*), we have 

Pi{ai,di,e-i,co) >pi{di-K,di,e^i,co) > Zi{H\ A^ \i,co), (24) 

where the first inequality is by monotonicity (B2) and the second follows from Lemma 7 

since di — K is a decision period, and would remain one with report = (dj — K,di,w'j^) 
by Lemma 5. This establishes Pi{di, di, 9-i, to) > q{di,di, 9-i, uj) for di = di -\- K — 1. When 
di = di + K — 2, then we need Eq. (24), and also when t = di — K + 1 is a decision period 
(and i ^ SNT*) we have, 

Pi{di,di,9-i,u) >pi{di-K-^l,di,9-i,uj) > Zi{H\A*\i,uj), (25) 

by the same reasoning as above. This generalizes to dj = -|- if — r for r G {2, . . . ,K} 
to establish pi{di,di,9-i,uj) > q{di,di,9-i,uj) for the general case. To see the bid must be 
priced-out when Wi < q{di, di, 9-i, uj), note that if it were to remain active it could match in 
the matching rule and by canonical (iii) need to trade, and thus fail individual-rationality 
since the payment collected would be more than the value. □ 
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Abstract 



In this paper we present and evaluate a general framework for the design of truthful 
auctions for matching agents in a dynamic, two-sided market. A single commodity, such 
' as a resource or a task, is bought and sold by multiple buyers and sellers that arrive 

, and depart over time. Our algorithm. Chain, provides the first framework that allows a 

' truthful dynamic double auction (DA) to be constructed from a truthful, single-period (i.e. 

O . static) double- auction rule. The pricing and matching method of the Chain construction 

is unique amongst dynamic-auction rules that adopt the same building block. We examine 
experimentally the allocative efficiency of Chain when instantiated on various single-period 
^ ' rules, including the canonical McAfee double-auction rule. For a baseline we also consider 

^ : non-truthful double auctions populated with "zero- intelligence plus" -style learning agents. 

CHAiN-based auctions perform well in comparison with other schemes, especially as arrival 
intensity falls and agent valuations become more volatile. 



1. Introduction 



Electronic markets are increasingly popular as a method to facilitate increased efficiency 
in the supply chain, with firms using markets to procure goods and services. Two-sided 
markets facilitate trade between many buyers and many sellers and find application to 
I trading diverse resources, including bandwidth, securities and pollution rights. Recent years 

have also brought increased attention to resource allocation in the context of on-demand 
computing and grid computing. Even within settings of cooperative coordination, such as 
those of multiple robots, researchers have turned to auctions as methods for task allocation 
and joint exploration (?, ?, ?). 

In this paper we consider a dynamic two-sided market for a single commodity, for in- 
stance a unit of a resource (e.g. time on a computer, some quantity of memory chips) or 
a task to perform (e.g. a standard database query to execute, a location to visit). Each 
agent, whether buyer or seller, arrives dynamically and needs to be matched within a time 
interval. Cast as a task-allocation problem, a seller can perform the task when allocated 
within some time interval and incurs a cost when assigned. A buyer has positive value for 
the task being assigned (to any seller) within some time interval. The arrival time, accept- 
able time interval, and value (negative for a seller) for a trade are all private information 
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to an agent. Agents are self-interested and can choose to misrepresent all and any of this 
information to the market in order to obtain a more desirable price. 

The matching problem combines elements of online algorithms and sequential decision 
making with considerations from mechanism design. Unlike traditional sequential decision 
making, a protocol for this problem must provide incentives for agents to report truthful 
information to a match-maker. Unlike traditional mechanism design, this is a dynamic 
problem with agents that arrive and leave over time. We model this problem as a dynamic 
double auction (DA) for identical items. The match-maker becomes the auctioneer. Each 
seller brings a task to be performed during a time window and each buyer brings the 
capability to perform a single task. The double-auction setting also is of interest in its own 
right as a protocol for matching in a dynamic business-to-business exchange. 

Uncertainty about the future coupled with the two-sided nature of the market leads to 
an interesting mechanism design problem. For example, consider the scenario where the 
auctioneer must decide how (and whether) to match a seller with reported cost of $6 at the 
end of its time interval with a present and unmatched buyer, one of which has a reported 
value of $8 and one a reported value of $9. Should the auctioneer pair the higher bidder 
with the seller? What happens if a seller, willing to sell for $4, arrives after the auctioneer 
acts upon the matching decision? How should the matching algorithm be designed so that 
no agent can benefit from misstating its earliest arrival, latest departure, or value for a 
trade? 

Chain provides a general framework that allows a truthful dynamic double auction to 
be constructed from a truthful, single-period (i.e. static) double-auction rule. The auctions 
constructed by Chain are truthful, in the sense that the dominant strategy for an agent, 
whatever the future auction dynamics and bids from other agents, is to report its true 
value for a trade (negative if selling) and true patience (maximal tolerance for trade delay) 
immediately upon arrival into the market. We also allow for randomized mechanisms and, 
in this case, require strong truthfulness: the DA should be truthful for all possible random 
coin flips of the mechanism. One of the DAs in the class of auctions implied by Chain 
is a dynamic generalization of McAfee's (?) canonical truthful, no-deficit auction for a 
single period. Thus, we provide the first examples of truthful, dynamic DAs that allow for 
dynamic price competition between buyers and sellers0 

The main technical challenge presented by dynamic DAs is to provide truthfulness with- 
out incurring a budget deficit, while handling uncertainty about future trade opportunities. 
Of particular concern is to ensure that an agent does not indirectly affect its price through 
the effect of its bid on the prices faced by other agents and thus other supply and demand 
in the market. We need to preclude this because the availability of trades depends on the 
price faced by other agents. For example, a buyer that is required to pay $4 in the DA to 
trade might like to decrease the price that a potentially matching seller will receive from $6 
to $3 to allow for trade. 

Chain is a modular approach to auction design, which takes as a building block a single- 
period matching rule and provides a method to invoke the rule in each of multiple periods 
while also providing for truthfulness. We characterize properties that a well-defined single- 

1. The closest work in the hterature is due to Blum et al. (?), who present a truthful, dynamic DA for our 
model that matches bids and asks based on a price sampled from some bid-independent distribution. 
We compare the performance of our schemes with this scheme in Section [B] 
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period matching rule must satisfy in order for Chain to be truthful. We further identify the 
technical property of strong no-trade, with which we can isolate agents that fail to trade in 
the current period but can nevertheless survive and be eligible to trade in a future period. 
An auction designer defines the strong no-trade predicate, in addition to providing a well- 
defined single-period matching rule. Instances within this class include those constructed 
in terms of both "price-based" matching rules and "competition-based" matching rules. 
Both can depend on history and be adaptive, but only the competition-based rules use the 
active bids and asks to determine the prices in the current period, facilitating a more direct 
competitive processes. 

In proving that Chain, when combined with a well-defined matching rule and a valid 
strong no-trade predicate, is truthful we leverage a recent price-based characterization for 
truthful online mechanisms (?). We also show that the pricing and matching rules defined 
by Chain are unique amongst the family of mechanisms that are constructed with a single- 
period matching rule as a building block. Throughout our work we assume that a constant 
limits every buyer and seller's patience. To motivate this assumption we provide a simple 
environment in which no truthful, no-deficit DA can implement some constant fraction of 
the number of the efficient trades, for any constant. 

We adopt allocative efficiency as our design objective, which is to say auction protocols 
that maximize the expected total value from the sequence of trades. We also consider net 
efficiency, wherein any net outflow of payments to the marketmaker is also accounted for in 
considering the quality of a design. Experimental results explore the allocative efficiency of 
Chain when instantiated to various single-period matching rules and for a range of different 
assumptions about market volatility and maximal patience. For a baseline we consider the 
efficiency of a standard (non-truthful) open outcry DA populated with simple adaptive 
trading agents modeled after "zero-intelligence plus" (ZIP) agents (?, ?). We also compare 
the efficiency of Chain with that of a truthful online DA due to Blum et al. (?), which 
selects a fixed trading price to guarantee competitiveness in an adversarial model. 

From within the truthful mechanisms we find that adaptive, price-based instantiations 
of Chain are the most effective for high arrival intensity and low volatility. Even defining 
a single, well-chosen price that is optimized for the market conditions can be reasonably 
effective in promoting efficient trades in low volatility environments. On the other hand, 
for medium to low arrival intensity and medium to high volatility we find that the Chain- 
based DAs that allow for dynamic price competition, such as the McAfee-based rule, are 
most efficient. The same qualitative observations hold whether one is interested in allocative 
efficiency or net efficiency, although the adaptive, price-based methods have better perfor- 
mance in terms of net efficiency. The Blum et al. (?) rule fairs poorly in our tests, which is 
perhaps unsurprising given that it is optimized for worst-case performance in an adversar- 
ial setting. When populated with ZIP agents, we find that non-truthful DAs can provide 
very good efficiency in low volatility environments but poor performance in high volatility 
environments. The good performance of the ZIP-bascd market occurs when agents learn 
to bid approximately truthfully; i.e., when the market operates as if truthful, but without 
incurring the stringent cost (e.g., through trading constraints) of imposing truthfulness ex- 
plicitly. An equilibrium analysis is available only for the truthful DAs; we have no way of 
knowing how close the ZIP agents arc to playing an equilibrium, and note that the ZIP 
agents do not even consider performing time-based manipulations. 
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1.1 Outline 

Section [2] introduces the dynamic DA model, including our assumptions, and presents 
desiderata for online DAs and a price-based characterization for the design of truthful 
dynamic auctions. Section [3] defines the Chain algorithm together with the building block 
of a well-defined, single-period matching rule and the strong no-trade predicate. Section U] 
gives a number of instantiations to both price-based and competition-based matching rules, 
including a general method to define the strong no-trade predicate given a price-based in- 
stantiation. Section [5] proves truthfulness, no-deficit and feasibility of the Chain auctions 
and also establishes their uniqueness amongst auctions constructed from the same single- 
period matching-rule building block. The importance of the assumption about maximal 
agent patience is established. Section [6] presents our empirical analysis, including a descrip- 
tion of the simple adaptive agents that we use to populate a non-truthful open-outcry DA 
and provide a benchmark. Section [7| gives related work. In Section [8] we conclude with a 
discussion about the merits of truthfulness in markets and present possible extensions. 

2. Preliminaries: Basic Definitions 

Consider a dynamic auction model with discrete, possibly infinite, time periods T = 
{1, 2, . . .}, indexed by t. The double auction (DA) provides a market for a single commodity. 
Agents are either buyers or sellers interested in trading a single unit of the commodity. An 
agent's type, 9i = {ai,di,Wi) G Gj, where Qi is the set of possible types for agent i, defines 
an arrival aj, departure di, and value G M for trade. If the agent is a buyer, then Wi > 0. 
If the agent is a seller, then Wi < 0. We assume a maximal patience K, so that di < ai + K 
for all agents. 

The arrival time models the first time at which an agent learns about the market or 
learns about its value for a trade. Thus, information about its type is not available before 
period aj (not even to agent i) and the agent cannot engage in trade before period a^. The 
departure time, di, models the final period in which a buyer has positive value for a trade, 
or the final period in which a seller is willing to engage in trade. We model risk-neutral 
agents with quasi-linear utility, Wi — p when a trade occurs in t G [ai,c?i] and payment p 
is collected (with p < if the agent is a seller). Agents are rational and self-interested, 
and act to maximize expected utility. By assumption, sellers have no utility for payments 
received after their true departure period. 

Throughout this paper we adopt bid to refer, generically, to a claim that an agent - 
either a buyer or a seller - makes to a DA about its type. In addition, when we need to be 
specific about the distinction between claims made by buyers and claims made by sellers 
we refer to the bid from a buyer and the ask from a seller. 

2.1 Example 

Consider the following naive generalization of the (static) trade-reduction DA (?, ?) to 
this dynamic environment. A bid from an agent is a claim about its type 9i = {di,di,Wi), 
necessarily made in period t = hi. Bids are active while t G [aj,di] and no trade has 
occurred. 

Then in each period t, use the trade-reduction DA to determine which (if any) of the 
active bids trade and at what price. These trades occur immediately. The trade-reduction 
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DA (tr-DA) works as follows: Let B denote the set of bids and S denote the set of asks. 
Insert a dummy bid with value +00 into B and a dummy ask with value into S. When 
> 2 and l^*! > 2 then sort B and S in order of decreasing value. Let Wb^ > Wb^ > ■ ■ ■ and 
Wso > ^si > ■ ■ ■ denote the bid and ask values with {bo,so) denoting the dummy bid-ask 
pair. Let m > index the last pair of bids and asks to clear in the efficient trade, such that 
^brn + '"^sm > Wb^_^_-^ + ^s^+i < 0. Wheu TTT, > 2 then bids {61, ... , bm-i} and asks 
{si, . . . , Sm-i} trade and payment is collected from each winning buyer and payment 
—Wg^ is made to each winning seller. 

First consider a static tr-DA with the following bids and asks: 
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bl 


15 




-1 




10 




-1 


h* 
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^3 


-2 


b4 
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S4 
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-5 



The line indicates that bids (1-4) and asks (1-4) could be matched for efficient trade. 
By the rules of the tr-DA, bids (1-3) and asks (1-3) trade, with payments $3 collected from 
winning buyers and payment $2 made to winning sellers. The auctioneer earns a profit of 
$3. The asterisk notation indicates the bids and asks that trade. The tr-DA is truthful, in 
the sense that it is a dominant-strategy for every agent to report its true value whatever 
the reports of other agents. For intuition, consider the buy-side. The payment made by 
winners is independent of their bid price while the losing bidder could only win by bidding 
more than $4, at which point his payment would be $4 and more than his true value. 

Now consider a dynamic variation with buyer types {(1, 2, 15), (1, 2, 10), (1, 2, 4), (2, 2, 3)} 
and seller types {(1, 2, -1), (2, 2, -1), (1, 1, -2), (2, 2, -2), (1, 2, -5)}. When agents are truth- 
ful, the dynamic tr-DA plays out as follows: 
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In period 1, buyer 1 and seller 1 trade at payments of $10 and $2 respectively. In 
period 2, buyer 2 and seller 2 trade at payments of $4 and $2 respectively. But now we can 
construct two kinds of manipulation to show that this dynamic DA is not truthful. First, 
buyer 1 can do better by delaying his reported arrival until period 2: 
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Now, buyer 2 trades in period 1 and does not set the price to buyer 1 in period 2. Instead, 
buyer 1 now trades in period 2 and makes payment $4. 

Second, buyer 3 can do better by increasing his reported value: 

period 1 period 2 
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Now, buyers 1 and 2 both trade in period 1 and this allows buyer 3 to win (at a price 
below his true value) in period 2. This is a particularly interesting manipulation because 
the agent's manipulation is by increasing its bid above its true value. By doing so, it allows 
more trades to occur and makes the auction less competitive in the next period. 

2.2 Dynamic Double Auctions: Desiderata 

We consider only direct-revelation, dynamic DAs that restrict the message that an agent 
can send to the auctioneer to a single, direct claim about its type. We also consider "closed" 
auctions so that an agent receives no feedback before reporting its type and cannot condition 
its strategy on the report of another agentJl 

Given this, let 0* denote the set of agent types reported in period t, 9 = {9^ ,6'^ , . . . ,9^, . . . ,) 
denote a complete type profile (perhaps unbounded), and 9-* denote the type profile re- 
stricted to agents with (reported) arrival no later than period t. A report 9i = {ai,di,Wi) 
represents a commitment to buy (sell) one unit of the commodity in any period t € [aj,(ij] 
for a payment of at most Wi. Thus, if a seller reports a departure time di > di, it must 
commit to complete a trade that occurs after her true departure and even though a seller 
is modeled as having no utility for payments received after her true departure. 

A dynamic DA, M = (vr, x), defines an allocation policy vr = {vr*}*^^ and payment 
policy X = {x*}*^^, where 7r*(^-*) G {0, 1} indicates whether or not agent i trades in period 
t given reports 9-^, and x\{9-^) G M indicates a payment made by agent negative if this is 
a payment received by the agent. The auction rules can also be stochastic, so that 7r*(0-*) 
and x\{9-^) are random variables. For a dynamic DA to be well defined, it must hold that 
'k\{9-*) = 1 in at most one period t € [cj, di] and zero otherwise, and the payment collected 
from agent i is zero except in periods t G [ai,di\. 

In formalizing the desiderata for dynamic DAs, it will be convenient to adopt {'k{9),x{9)) 
to denote the complete sequence of allocation decisions given reports 9, with shorthand 
T^i{9) G {0, 1} and Xi[9) G M to indicate whether agent i trades during its reported arrival- 
departure interval, and the total payment made by agent i, respectively. By a slight abuse 
of notation, we write i G 9-* to denote that agent i reported a type no later than period t. 
Let B denote the set of buyers and S denote the set of sellers. 



2. The restriction to direct-revelation, online mechanisms is without loss of generality when combined with 
a simple heart-beat message from an agent to indicate its presence in any period t during its reported 
arrival-departure interval. See the work of Pai and Vohra (?) and Parkes (?). 
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We shall require that the dynamic DA satisfies no-deficit, feasibility, individual-rationality 
and truthfulness. No-deficit ensures that the auctioneer has a cash surplus in every period: 

Definition 1 (no-deficit) A dynamic DA, M = (tt, x) is no-deficit if: 

J2 E ^f(^-*')>0' vt,ve (1) 

Feasibility ensures that the auctioneer does not need to take a short position in the 
commodity traded in the market in any period: 

Definition 2 (feasible trade) A dynamic DA, M = (vr, x) is feasible if: 

E E -^(^-')- E E -f(^-'')>o, yt,y9 (2) 

ie9i-t,ieSt'e[ai ,uiin{t,di)] iee^t,ieBt'e[ai ,min(t,dj )] 

This definition of feasible trade assumes that the auctioneer can "hold" an item that 
is matched between a seller-buyer pair, for instance only releasing it to the buyer upon 
his reported departure. See the remark concluding this section for a discussion of this 
assumption. 

Let Vi{9i,Tr{9'-,0-i)) G R denote the value of an agent with type 9i for the allocation 
decision made by policy vr given report {9'-,9^i), i.e. Vi{9i,T^{9[,9-i)) = Wi if the agent 
trades in period t € [oi, di] and if it trades outside of this interval and is a buyer, or — oo if 
it trades outside of this interval and is a seller. Individual-rationality requires that agent i's 
utility is non-negative when it reports its true type, whatever the reports of other agents: 

Definition 3 (individual-rational) A dynamic DA, M = (7r,x) is individual-rational 
(IR) if Vi{9i,TT{9)) - Xi{9) > for all i, all 9. 

In order to define truthfulness, we introduce notation C{9i) C 0j for 9i G Qi to denote 

the set of available misreports to an agent with true type 9i. In the standard model adopted 
in offline mechanism design, it is typical to assume C{9i) = Qi with all misreports available. 
Here, we shall assume no early-arrival misreports, with C{9i) = {9i = (aj, di,Wi) : ai < en < 
di}. This assumption of limited misreports is adopted in earlier work on online mechanism 
design (?), and is well-motivated when the arrival time is the first period in which a buyer 
first decides to acquire an item or the period in which a seller first decides to sell an item. 

Definition 4 (truthful) Dynamic DA, M = (vr, x), is dominant- strategy incentive- 
compatible, or truthful, given limited misreports C if: 

v,{9„ 7t{9„9'_,)) - x,{9i, 9'_i) > v,{9i,7Ti9i,9'_,)) - Xi{9i, 9'_,). 

for all 9i G C{9i), all 9i, all 9'_i G C{9-i), all 9-i G 6-^. 

This is a robust equilibrium concept: an agent maximizes its utility by reporting its 
true type whatever the reports of other agents. Truthfulness is useful because it simplifies 
the decision problem facing bidders: an agent can determine its optimal bidding strategy 
without a model of either the auction dynamics or the other agents. In the case that 
the allocation and payment policy is stochastic, then we adopt the requirement of strong 
truthfulness so that an agent maximizes its utility whatever the random sequence of coin 
flips within the auction. 
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Remark. The flexible definition of feasibility, in which the auctioneer is able to take a 
long position in the commodity, allows the auctioneer to time trades by receiving the unit 
sold by a seller in one period but only releasing it to a buyer in a later period. This 
allows for truthfulness in environments in which bidders can overstate their departure pe- 
riod. In some settings this is an unreasonable requirement, however, for instance when 
the commodity represents a task that is performed, or because a physical good is being 
traded in an electronic market H In these cases, the definition of feasibility strengthened 
to require exact trade-balance in every period. The tradeoff is that available misreports 
must be further restricted, with agents limited to reporting no late- departures in addition 
to no early -arrivals (?, ?). For the rest of the paper we work in the "relaxed feasibility, no 
early- arrival" model. The Chain framework can be immediately extended to the "strong- 
feasibility, no early-arrival and no late-departure" model by executing trades immediately 
rather than delaying the trade until a buyer's departure. 

3. Chain: A Framework for Truthful Dynamic DAs 

Chain provides a general algorithmic framework with which to construct truthful dynamic 
DAs from well-defined single-period matching rules, such as the tr-DA rules described in 
the earlier section. 

Before introducing Chain we need a few more definitions: Bids reported to Chain are 
active while t < di (for reported departure period di), and while the bid is unmatched 
and still eligible to be matched. In each period, a single-period matching rule is used to 
determine whether any of the active bids will trade and also which (if any) of the bids that 
do not match will remain active in the next period. 

Now we define the building blocks, well-defined single-period matching rules, and intro- 
duce the important concept of a strong no-trade predicate, which is defined for a single- 
period matching rule. 

3.1 Building Block: A Single- Period Matching Rule 

In defining a matching rule, it is helpful to adopt 6* G &iid s* € M<g to denote the 
active bids and active asks in period t, where there are m > and n > bids and asks 
respectively. The bids and asks that were active in earlier periods but are no longer active 
form the history in period t, denoted € where /i > is the size of the history. 

A single-period matching rule (hereafter a matching rule), Mmr = {'^mr,Xmr) defines 
an allocation rule 'J^rnr 

{H^,b^,s^,oo) G {0,1}^"^^^^ and a payment rule Xmr{H^ , ^ 
j^(m+n)^ Here, we include random event cj G to allow explicitly for stochastic matching 
and allocation rules. 

Definition 5 (well-defined matching rule) A matching rule Mmr = {'^mr,Xmr) is well- 
defined when it is strongly truthful, no-deficit, individual-rational, and strong-feasible. 

Here, the properties of truthfulness, no-deficit, and individual-rationality are exactly 
the single-period specializations of those defined in the previous section. For instance, 

3. Note that if the task is a computational task, then tasks can be handled within this model by requiring 
that the seller performs the task when it is matched but with a commitment to hold onto the result until 
the matched buyer is ready to depart. 
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function SiMPLEMATCH(i?*,6*,s*) 
matched := 
J)* := mean(|iJ*|) 
while (&* =^ 0)&(s* ^ 0) do 

i := 0, h := -e, j := 0, := -cxo 
while (&, < p*)&(6* 7^ 0) do 

i := random(&*), 6* := &* \ {i} 
end while 

while (sj < -p*)&(s* 7^ 0) do 

j := randoni(s*), s* := s* \ {j} 
end while 

if {i ^ 0)&(j ^ 0) then 

matched :— matched U {(«, j)} 
end if 
end while 
end function 

Figure 1: A well-defined matching rule defined in terms of the mean bid price in the history. 

a matching rule is truthful in this sense when the dominant strategy for an agent in a 
DA defined with this rule, and in a static environment, is to bid truthfully and for all 
possible random events uj. Similarly for individual-rationality. No-deficit requires that the 
total payments are always non-negative. Strong-feasibility requires that exactly the same 
number of asks are accepted as bids, again for all random events. 

One example of a well-defined matching rule is the tr-DA, which is invariant to the 
history of bids and asks. For an example of a well-defined, adaptive (history-dependent) 
and price-based matching rule, consider procedure SimpleMatch in Figure [TJ The SlM- 
pleMatch matching rule computes the mean of the absolute value of the bids and asks in 
the history and adopts this as the clearing price in the current period. It is a stochastic 
matching rule because bids and asks are picked from the sets 6* and s* at random and 
offered the price. We can reason about the properties of SimpleMatch as follows: 

(a) truthful: the price is independent of the bids and the probability that a bid (or 
ask) is matched is independent of its bid (or ask) price 

(b) no-deficit: payment is collected from each matched buyer and made to each 
matched seller 

(c) individual-rational: only bids bi > and asks Sj > — are accepted. 

(d) feasible: bids and asks are introduced to the "matched" set in balanced pairs 

3.2 Reasoning about Trade (Im)Possibility 

In addition to defining a matching rule M^r, we allow a designer to (optionally) designate 
a subset of losing bids that satisfy a property of strong no-trade. Bids that satisfy strong 
no-trade are losing bids for which trade was not possible at any bid price (c.f. ask price 
for asks), and moreover for which additional independence conditions hold between bids 
provided with this designation. 

We first define the weaker concept of no-trade. In the following, notation 
7rmr,i{H*' , julwi) indicates the allocation decision made for bid (or ask) i when its bid 
(ask) price is replaced with Wi: 
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Definition 6 (no-trade) Given matching rule Mmr = [T^mriXmr) then the set of agents, 
NT*, for which no trade is possible in period t and given random events uj are those for 
which 7rmr,i{H^ , , , ^{wi) = 0? for every Wi G M>o when i € and for every Wi € M<o 
when i & s*. 

It can easily happen that no trade is possible, for instance when the agent is a buyer 
and there are no sellers on the other side of the market. Let SNT* C NT* denote the set 
of agents designated with the property of strong no-trade. Unlike the no-trade property, 
strong no-trade need not be uniquely defined for a matching rule. To be valid, however, the 
construction offered by a designer for strong no-trade must satisfy the following: 

Definition 7 (strong no-trade) A construction for strong no-trade, SNT* C NT*, is 
valid for a matching rule when: 

(a) yi £ NT* with di > t, whether or not i G SNT* is unchanged for all alternate reports 

e[ = {a[,d[,w[) + Oi while d[ > t, 

(b) Mi G SNT* with di > t, the set {j : j € SNT*,j 7^ i,dj > t} is unchanged for all 
reports 9[ = (a^, d[.,w[) 7^ 9i while d[ > t, and independent even of whether or not agent i is 
present in the market. 

The strong no-trade conditions must be checked only for agents with a reported depar- 
ture later than the current period. Condition (a) requires that such an agent in NT* cannot 
affect whether or not it satisfies the strong no-trade predicate as long as it continues to 
report a departure later than the current period. Condition (b) is defined recursively, and 
requires that if such an agent is identified as satisfying strong no-trade, then its own report 
must not affect the designation of strong no-trade to other agents, with reported departure 
later than the current period, while it continues to report a departure later than the current 
period - even if it delays its reported arrival until a later period. 

Strong no-trade allows for flexibility in determining whether or not a bid is eligible for 
matching. Specifically, only those bids that satisfy strong no-trade amongst those that lose 
in the current period can remain as a candidate for trade in a future period. The property 
is defined to ensure that such a "surviving" agent does not, and could not, affect the set of 
other agents against which it competes in future periods. 



Example 1 Consider the tr-DA matching rule defined earlier with bids and asks 
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Bid 1 and ask 1 trade at price 8 and —6 respectively. NT* = because bids 2 and 3 could 
each trade if they had (unilaterally) submitted a bid price of greater than 10. Similarly for 
asks 2 and 3. Now consider the order book 
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No trade occurs. In this case, NT* = {61,62,63,51}. No trade is possible for any bids, even 
bids 2 and 3, because w^^ +?i's2 = 8 — 10 < 0. But, trade is possible for asks 2 and 3, because 
Wb2 + = 7 — 6 > and either ask could trade by submitting a low enough ask price. 

Example 2 Consider the tr-DA matching rule and explore possible alternative construc- 
tions for strong no-trade. 

(i) Dictatorial: in each period t, identify an agent that could be present in the period in 
a way that is oblivious to all agent reports. Let i denote the index of this agent. If i & NT*, 
then include SNT* = {i}. Strong no-trade condition (a) is satisfied because whether or not i 
is selected as the "dictator" is agent-independent, and given that it is selected, then whether 
or not trade is possible is agent-independent. Condition (b) is trivially satisfied because 
I SNT* I = 1 and there is no cross-agent coupling to consider. 

(a) SNT* := NT*. Consider the order book 
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Suppose all bids and asks remain in the market for at least one more period. Clearly, 
NT* = {61,62763,81,82,53}. Consider the candidate construction SNT* = NT*. Strong no- 
trade condition (a) is satisfied because whether or not i is in set NT* is agent-independent. 
Condition (b) is not satisfied, however. Consider bid 2. If bid 2's report had been 8 instead 
of 2 then trade would be possible for bids 1 and 3, and SNT* = NT* = {62, si, S2, S3}. Thus, 
whether or not bids 1 and 3 satisfy the strong no-trade predicate depends on the value of bid 
2. This is not a valid construction for strong no-trade for the tr-DA matching rule. 

(Hi) SNT* = NT* if |6*| < 2 or |s*| < 2, and SNT* = otherwise. As above, strong 
no-trade condition (a) is immediately satisfied. Moreover, condition (b) is now satisfied 
because trade is not possible for any bid or ask irrespective of bid values because there are 
simply not enough bids or asks to allow for trade with tr-DA (which needs at least 2 bids 
and at least 2 asks). 
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Figure 2: The decision process in Chain upon arrival of a new bid. If admitted, then the bid 
participates in a sequence of matching events while it remains unmatched and in the 
strong no-trade set. The bid matches at the first available opportunity and is priced 
immediately. 

Example 3 Consider a variant of the SimpleMatch matching rule, defined with fixed 
price 9. We can again ask whether SNT* := NT* is a valid construction for strong no-trade. 
Throughout this example suppose all bids and asks remain in the market for at least one 
more period. First consider a bid with = 8 and two asks with values Wsi = —6 and 
— ■ Here, NT* = {si,S2} because the asks cannot trade whatever their price since 
the bid is not high enough to meet the fixed trading price of 9. Moreover, SNT* = {si,S2} 
is a valid construction; strong no-trade condition (a) is satisfied as above and condition (b) 
is satisfied because whether or not ask 2 is in NT* ( and thus SNT* j is independent of the 
price on ask 1, and vice versa. But consider instead a bid with ^t)^,^ = 8 and an ask with 
Wg^ = —10. Now, NT* = {6i,si} and SNT* = {6i,si} is our candidate strong no-trade set. 
However if bid 1 had declared value 10 instead of 8 then NT* = {bi} and ask 1 drops out of 
SNT*. Thus, strong no-trade condition (b) is not satisfied. 

We see from the above examples that it can be quite delicate to provide a valid, non- 
trivial construction of strong no-trade. Note, however, that SNT* = is a (trivial) valid 
construction for any matching rule. Note also that the strong no-trade conditions (a) and 
(b) require information about the reported departure period of a bid. Thus, while the 
matching rules do not use temporal information about bids, this information is used in the 
construction for strong no-trade. 

3.3 Chain: From Matching Rules to Truthful, Dynamic DAs 

The control flow in Chain is illustrated in Figure [21 Upon arrival of a new bid, an admission 
decision is made and bid i is admitted if its value wi is at least its admission price qi. An 
admitted bid competes in a sequence of matching events, where a matching event simply 
applies the matching rule to the set of active bids and asks. If a bid fails to match in some 
period and is not in the strong no-trade set {i ^ SNT*), then it is priced out and leaves the 
market without trading. Otherwise, if it is still before its departure time (t < dj), then it is 
available for matching in the next period. 

Each bid is always in one of three states: active, matched or priced- out. Bids are active 
if they are admitted to the market until t < di, or until they are matched or priced-out. An 
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active bid becomes matched in the first period (if any) when it trades in the single-period 
matching rule. An active bid is marked as priced-out in the first period in which it loses 
but is not in the strong no-trade set. As soon as a bid is no longer active, it enters the 
history, , and the information about its bid price can be used in defining matching rules 
for future periods. 

Let denote the set of bids that will expire in the current period. A well-defined 
matching rule, when coupled with a valid strong no-trade construction, must provide Chain 
with the following information, given history H*, active bids 6* and active asks s*, and 
expiration set in period t: 

(a) for each bid or ask, whether it wins or loses 

(b) for each winning bid or ask, the payment collected (negative for an ask) 

(c) for each losing bid or ask, whether or not it satisfies the strong no-trade condition 

Note that the expiration set E*' is only used for the strong no-trade construction. This 
information is not made available to the matching rule. The following table summarizes the 
use of this information within Chain. Note that a winning bid cannot be in set SNT*: 





-.SNT* 


SNT* 


Lose 


priced-out 


survive 


Win 


matched 


n/a 



We describe Chain by defining the events that occur for a bid upon its arrival into the 
market, and then in each period in which it remains active: 

Upon arrival: Consider all possible earlier arrival periods t' G [di — K,ai — 1] consistent 
with the reported type. There are no such periods to consider if the bid is maximally 
patient. If the bid would lose and not be in SNT* for any one of these arrival periods 
t', then it is not admitted. Otherwise, the bid would win in all periods t' for which 
i ^ SNT* , and define the admission price as: 

q{ai,di,e-i,uj) := max [pf,-oo], (3) 

t'e[di-K,ai-l],i^SNT* 

where p*' is the payment the agent would have made (negative for a seller) in arrival 
period t' (as determined by running the myopic matching rule in that period). When 
the agent would lose in all earlier arrival periods t' (and so i G SNT* for all t'), or 
the bid is maximally patient, then the admission price defaults to — oo and the bid is 
admitted. 

While active: Consider period i G [oj, di]. If the bid is selected to trade by the myopic 
matching rule, then mark it as matched and define final payment: 

= niax{q{di,di,e-i,u}),p\), (4) 

where p^ is the price (negative for a seller) determined by the myopic matching rule in 
the current period. If this is a buyer, then collect the payment but delay transferring 
the item until period di. If this is a seller, then collect the item but delay making 
the payment until the reported departure period. If the bid loses and is not in SNT*, 
then mark the bid as priced- out. 
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We illustrate Chain by instantiating it to various matching rules in the next section. 
In Section [5] we prove that Chain is strongly truthful and no-deficit when coupled with a 
well-defined matching rule and a valid strong no-trade construction. We will see that the 
delay in buyer delivery and seller payment ensures truthful revelation of a trader's departure 
information. For instance, in the absence of this delay, a buyer might be able to do better by 
over-reporting departure information, still trading early enough but now for a lower price. 

3.4 Comments 

We choose not to allow the single-period matching rules to use the reported arrival and 
departure associated with active bids and asks. This maintains a clean separation between 
non-temporal considerations (in the matching rules) and temporal considerations (in the 
wider framework of Chain). This is also for simplicity. The single-period matching rules 
can be allowed to depend on the reported arrival-departure interval, as long as the (single- 
period) rules are monotonic in tighter arrival-departure intervals, in the sense that an agent 
that wins for some 6i = {ai,di,Wi) continues to win and for an improved price if it instead 
reports {a'-,d'-,Wi) with C [di,di]. However, whether or not trade is possible must 

be independent of the reported arrival-departure interval and similarly for strong no-trade. 
Determinations such as these would need to be made with respect to the most patient type 
{di - K,di,Wi) given report 0i = {di,di,Wi). 

4. Practical Instantiations: Price-Based and Competition-Based Rules 

In this section we offer a number of instantiations of the Chain online DA framework. We 
present two different classes of well-defined matching rules: those that are price-based and 
compute simple price statistics based on the history which are then used for matching, and 
those that we refer to as competition-based and leverage the history but also consider direct 
competition between the active bids and asks in any period. In each case, we establish that 
the matching rules are well-defined and provide a valid strong no-trade construction. 

4.1 Price-Based Matching Rules 

Each one of these rules constructs a single price, p*, in period t based on the history of 
earlier bids and asks that are traded or expired. For this purpose we define variations on a 
real valued statistic, (,{H^), that is used to define this price given the history. Generalizing 
the SimpleMatch procedure, as introduced in Section [3Tl the price is used to determine 
the trades in period t. We also provide a construction for strong no-trade in this context. 

The main concern in setting prices is that they may be too volatile, with price updates 
driving the admission price higher (via the max operator in the admission rule of Chain) 
and having the effect of pricing bids and asks out of the market. We describe various forms 
of smoothing and windowing, all designed to provide adaptivity while dampening short- 
term variations. In each case, the parameters (e.g. the smoothing factor, or the window 
size) can be determined empirically through off-line tuning. 

We experiment with five price variants: 

History-EWMA: Exponentially-weighted moving average. The bid history, H^, is used 
to define price in period t, computed as := A £,{H^) + (1 — A)p*~^, where A G (0, 1] is a 
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smoothing constant and ^(i?*) is a statistic defined for bids and asks that enter the history 
in period t. Experimentally we find that the mean statistic, ^"^'''^■^(i?*), of the absolute 
values of bids and asks that enter the history performs well with A of 0.05 or lower for most 
scenarios that we test. For cases in which ^(-ff*) is not well-defined because of too few (or 
zero) new bids or asks, then we set := 

History-median: Compute price from a statistic over a fixed-size window of the most 
recent history, := ^(i^*,A) where A is the window-size, i.e. defining bids introduced 
to history in periods [t — A, . . . ,t]. Experimentally, we find that the median statistic, 
^median ^ of the absolute bid and ask values performs well for the scenarios we test, 
with the window size depending inversely with the volatility of agents' valuations. Typically, 
we observe optimal window sizes of 20 and 150, depending on volatility. For cases in which 
^(-ff*,A) is not well-defined because of too few (or zero) new bids or asks, then we set 

History-clearing: Identical to the history-median rule except the statistic ^(-fr*,A) is 
defined as (6^ — •Sm)/2 where hm and Sm are the lowest value pair of trades that would be 
executed in the efficient (value-maximizing) trade given all bids and asks to enter history 
i7* in periods [t — A, . . . Empirically, we find similar optimal window sizes for history- 
clearing as for history-median. 

History-McAfee: Define the statistic S,{H^, A) to represent the McAfee price, defined in 
Section [4.21 for the bids in had they all simultaneously arrived. 

Fixed price: This simple rule computes a single fixed price := p* for all trading periods, 
with the price optimized offline to maximize the average-case efflciency of the dynamic DA 
given Chain and the associated single-period matching rule that leverages price p* as the 
candidate trading price. 

For each pricing variant, procedure MATCH (see Figures [3H11) is used to determine which 
bids win (at price p*), which lose, and, of those that lose, which satisfy the strong no- 
trade predicate. The subroutine used to determine the current price is referred to as 
determineprice in MATCH. We provide as input to MATCH the set i?* in addition to 
(i/*,6*,s*) because MATCH also constructs the strong no-trade set, and is used exclu- 
sively for this purpose. 

The proof of the following lemma is technical and is postponed until the Appendix. 

Lemma 1 Procedure Match defines a valid strong no-trade construction. 

Theorem 1 Procedure Match defines a well-defined matching rule and a valid strong no- 
trade construction. 

Proof: No-deficit, feasibility, and individual-rationality are immediate by the construction 
of Match since bids and asks are added to matched in pairs, with the same payment, and 
only if the payment is less than or equal to their value. Truthfulness is also easy to see: the 
order with which a bid (or ask) is selected is independent of its bid price, and the price it 
faces, when selected, is independent of its bid. If the price is less than or equal to its bid, 
then whether or not it trades depends only on its order. The rest of the claim follows from 
Lemma [H □ 
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function MATCH(iJ*,6*,s*,^*) 

matched := 0, lose := 0, NT* := 0, SNT* := 
stop := false 

:= determineprice(i7*) 
while -1 stop do 

i := 0, j := 0, checked^ := 0, checked^ := 
while ((checkedij C b*)k{i=0)) V ((checkeds C s*)&(j=0)) do 
if (i = 0)&(j = 0) then 

k := random(6*\ checkedBUs*\ checked^) 
else if {i = 0) then 

k := random ( 6* \ checked^) 
else if (j = 0) then 

k := random(s*\ checkeds) 
end if 

if (A; G 6*) then 

checked^ := checked^ U {/c} 

if {bk > P*) then 
i := k 

end if 
else 

checked^ := checkeds U {fc} 
if (sfc > — P*) then 

j := k 
end if 
end if 
end while 

if {i + 0)&(j + 0) then 

matched := matched |J {(^,i)} 
lose := lose (J (checked^ \ {i}) U (checkeds \ {j}) 
6* := 5*\ checked^, s* := s*\ checked^ 
else 

stop := true 
end if 
end while 
end function 

Figure 3: The procedure used for single-period matching in applying Chain to the price- 
based rules. The algorithm continues in Figured! 
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function Match {continued) {H^ ,b* ,s* ,E'^) 

if {i 0)&(j = 0) then > I 

lose := lose \]s\ NT* := 6* 

if (3A; G 6* • > = t))) V (VA; G s* • (4 = t)) then 

SNT* := 6* 

else > I-a 

SNT* ■=h\ checkedB 
end if 

else if (j / 0)&(« = 0) then > II 

lose := lose U^**, NT* := s* 

if (3fc G s* • ((sfc > -p*)&(4 = t))) V (VA; G 6* • (4 = i)) then 

SNT* := s* 
else 

SNT* := s*\ checked^ 
end if 

else if {i = 0)&(j = 0) then > III 

NT* := 6* U s* 

if (VA; G 6* • (4 = t)) V (VA; G s* • (4 = t)) then 

SNT* := 6* U s* 
end if 
end if 
end function 

Figure 4: Continuing procedure from Figure [3] for single-period matching in applying 
Chain to the price-based rules. 



Example 4 (i) Bid 6* = {8}, ask s* = {—6}, indexed {1,2} and price p* = 9. The outer 
while loop in Figure terminates with j = 2 and i = in Case II. The bid is marked as 
a loser while NT* = {2}. // the bid will depart immediately, then SNT* = {2}, otherwise 
SNT* = 0. 

(a) Bid 6* = {8}, asks s* = {—6, —7}, indexed {1,2,3}, and price p* = 9. Suppose that 
ask 2 is selected before ask 3 in the outer while loop. Then the loop terminates with j = 2 
and i = in Case II and NT* = {2,3}. Suppose the bid and asks leave the market later 
than this period. Then SNT* = {3} because checkeds = {2}. 

(Hi) Bid 6* = {8} and ask s* = {—10}, indexed {1,2}, price p* = 9 and both the bid and 
the ask is patient. The outer while loop terminates with i = and j = in Case III so 
that NT* = {1,2}. However, SNT* = 0. 



4.2 Competition-Based Matching Rules 

Each one of these rules determines which bids match in the current period through price 
competition between the active bids. We present three variations: McAfee, Windowed- 
McAfee and Active-McAfee. The latter two rules are hybrid rules in that they leverage 
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history of past offers, in smoothing prices generated by the competition-based matching 
rules. 

McAfee: Use the static DA protocol due to McAfee as the matching rule. Let B denote 
the set of bids and S denote the set of asks. If min([i?|, \S\) < 2, then there is no trade. 
Otherwise, first insert two dummy bids with value {oo, 0} and two dummy asks with value 
{0, — oo} into the set of bids and asks. Let bo > bi > . . . > bm and sq > si > . . . > 
Sn ■ ■ ■ denote the bid and ask values with (6o, sq) denoting dummy pair (oo, 0) and {bm, s„) 
denoting dummy pair (0, — oo) and ties otherwise broken at random. Let m > index 
the last pair of bids and asks to clear in the efficient trade, such that bm + Sm ^ and 
6^+1 + Sm+i < 0. When m> 1, consider the following two cases: 

• (Case I) If price Pm+i = < 5^ and —pm+i < Sm then the first m bids and 
asks trade and payment Pm+i is collected from each winning buyer and made to each 
winning seller. 

• (Case II) Otherwise, the first m — 1 bids and asks trade and payment bm is collected 
from each winning buyer and payment — Srn is made to each winning seller. 

To define NT*, replace a bid that does not trade with a bid reporting a very large value 
and see whether this bid trades. To determine whether trade is possible for an ask that 
does not trade: replace the ask with an ask reporting value e > 0, some small e. Say 
that there is a quorum if and only if there are at least two bids and at least two asks, i.e. 
min(|6*|, |s*|) > 2. Define strong no-trade as follows: set SNT* := NT* = 6* U s* when 
there is no quorum and SNT* := otherwise. 

Lemma 2 For any bid bi in the McAfee matching rule, then for any other bid (or ask) j 
there is some bid bi that will make trade possible for bid (or ask) j when there is a quorum. 

Proof: Without loss of generality, suppose there are three bids and three asks. Label the 
bids (a, c, e) and the asks {b,d,f), both ordered from highest to lowest so that (a, 6) is the 
most competitive bid-ask pair. Proceed by case analysis on bids. The analysis is symmetric 
for asks and omitted. Let tp{i) G {0, 1} denote whether or not trade is possible for bid i, so 
that i G NT* <^ tp{i) = 0. For bid a: when b > -(a - d)/2 then tp{c) = tp{e) = 1 and this 
inequality can always be satisfied for a large enough a; when a > {c — d)/2 then tp{b) = 1 
and when a> (c — b)/2 then tp(d) = tp{f) = 1, and both of these inequalities are satisfied 
for a large enough a. For bid c: when b > —{c — d)/2 then tp{a) = 1 and when, in addition, 
c > a, then tp{e) = 1 and each one of these inequalities are satisfied for a large enough c; 
similarly when c > (a — d)/2 then tp{b) = 1 and when c > {a — b)/2 then tp{d) = tp{f) = 1. 
Analysis for bid e follows from that for bid c. □ 

Lemma 3 The construction for strong no-trade is valid and there is no valid strong no- 
trade construction that includes more than one losing bid or ask that will not depart in the 
current period for any period in which there is a quorum. 

Proof: To see that this is a valid construction, notice that strong no-trade condition (a) 
holds since any bid (or ask) is always in both NT* and SNT*. Similarly, condition (b) 
trivially holds (with the other bids and asks remaining in SNT* even if any bid is not 
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present in the market). To see that this definition is essentially maximal, consider now that 
min(|6*|, |s*|) > 2. For contradiction, suppose that two losing bids {i,j} with departure 
after the current period are designated as strong no-trade. But, strong no-trade condition 
(b) fails because of Lemma[2]because either bid could have submitted an alternate bid price 
that would remove the other bid from NT* and thus necessarily also from SNT*. □ 
The construction offered for SNT* cannot be extended even to include one agent selected 
at random from the set i £ NT* that will not depart immediately, in the case of a quorum. 
Such a construction would fail strong no-trade condition (b) when the set NT* contains 
more than one bid (or ask) that does not depart in the current period, because bid i's 
absence from the market would cause some other agent to be (randomly) selected as SNT*. 

Windowed-McAfee: This myopic matching rule is parameterized on window size A. 
Augment the active bids and asks with the bids and asks introduced to the history in 
periods t' € {t — A + 1, . . . ,t}. Run McAfee with this augmented set of bids and asks and 
determine which of these bids and asks would trade. Denote this candidate set C. Some 
active agents identified as matching in C may not be able to trade in this period because 
C can also contain non- active agents. 

Let B' and S' denote, respectively, the active bids and active asks in set C. Windowed- 
McAfee then proceeds by picking a random subset of min(|i?'|, |5'|) bids and asks to trade. 
When \B'\ ^ \S'\, then some bids and asks will not trade. 

Define strong no-trade for this matching rule as: 

(i) if there are no active asks but active bids, then SNT* := 6* 

(ii) if there are no active bids but active asks, then SNT* := s* 

(iii) if there are fewer than 2 asks or fewer than 2 bids in the augmented bid set, then 
SNT* := 6* U s*, 

and otherwise set SNT* := 0. In all cases it should be clear that SNT* C NT*. 

Lemma 4 The strong no-trade construction for windowed-McAfee is valid. 

Proof: That this is a valid SNT criteria in case (iii) follows immediately from the validity 
of the SNT criteria for the standard McAfee matching rule. Consider case (i). Case (ii) 
is symmetric and omitted. For strong no-trade condition (a), we see that all bids i E NT* 
and also i € SNT*, and whether or not they are designated strong no-trade is independent 
of their own bid price but simply because there are no active asks. Similarly, for strong 
no-trade condition (b), we see that all bids (and never any asks) are in SNT* whatever the 
bid price of any particular bid (and even whether or not it is present). □ 
Empirically, we find that the efficiency of Windowed-McAfee is sensitive to the size of 
i^* , but that frequently the best choice is a small window size that includes only the active 
bids. 

Active-McAfee: Active-McAfee augments the active bids and asks to include all un- 
expired but traded or priced-out offers. It proceeds as in Windowed-McAfee given this 
augmented bid set. 
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4.3 Extended Examples 

We next provide two stylized examples to demonstrate the matching performed by Chain 
using both a price-based and a competition-based matching rule. For both examples, we 
assume a maximal patience of K = 2. Moreover, while we describe when Chain determines 
that a bid or an ask trades, remember that a winning buyer is not allocated the good until 
its reported departure and a winning seller does not receive payment until its reported 
departure. 

Example 5 Consider Chain using an adaptive, price-based matching rule. The particular 
details of how prices are determined are not relevant. Assume that the prices in periods 1 
and 2 are = {8, 7} and the maximal patience is three periods. Now consider period 

3 and suppose that the order book is empty at the end of period 2 and that the bids and asks 
in Table [7] arrive in period 3. 



i 


Wi 


di 


B 

di-K 


Qi 


Pi 


SNT? 


i 


Wi 


di 


S 

di-K 


qi 


Pi 


SNT? 


bi* 


15 


4 


2 


7 


7 


N 


Sl 


-1 


4 


2 


-7 


n/a 


Y 


b2* 


10 


3 


1 


8 


8 


N 


S2* 


-3 


5 


3 


—00 


-6.5 


N 




7 


3 


1 


8 


n/a 


N 


s-i 


-4 


3 


1 


-7 


n/a 


Y 




6 


5 


3 


—00 


n/a 


N 




-5 


4 


2 


-7 


-6.5 


N 
















S5 


-10 


5 


3 


—00 


n/a 


Y 



Table 1: Bids and asks that arrive in period 3. Bids {&i, 62} niatch with asks {s2, 84} (as indicated 
with a *). Bid 63 is priced-out upon admission because qh^ > Wb^ (indicated with a strike- 
through). The admission price is and the payment made by an agent that trades is 
Pi- Column 'SNT?' indicates whether or not the bid or ask satisfies the strong no-trade 
predicate. Asks {51,35} survive into the next period because they are in SNT and have 
d^ > 3. 

Bids {bi,b2,b4^} and asks {si,..,S5} are admitted. Bid 63 is priced out because q^^ = 
m.ax{p^ ,p'^ , —00) = max(8,7, — 00) = 8 > Wb-^ = 7 by Eq. Note that 64 and S5 are 

admitted despite low bids (asks) because they have maximal patience and their admission 
prices are —00. Now, suppose that := 6.5 is defined by the matching rule and consider 
applying Match to the admitted bids and asks. 

Suppose that the bids are randomly ordered as (64,62i&i) cLnd the asks as 
(s4, S2, Sl, S3, S5). Bid 64 is picked first but priced-out because Wb^ = 6 < p^ = 6.5. Bid 
62 is tentatively accepted (wb2 = 10 > = Q.5) and then ask S4 is accepted (wg^ = —5 > 
p"^ = —6.5). Bid 62 is matched with ask S4, with payment may:{qb2,p^) = max(8, 6.5) = 8 
for 62 by Eq. ^ and payment ma,x{qs^,p'^) = max(— 00, —6.5) = —6.5 for S4. Bid hi is then 
tentatively accepted (15 > 6.5) and then matched with ask S2, which is accepted because 
—3 > —6.5. The payments are max(7, 6.5) = 7 for bi and max(— 00, — 6.5) = —6.5 for S2. 
Ask S3 expires but asks si and 55 survive and are marked i G SNT in this period because 
they were never offered the chance to match with any bid. These asks will be active in period 

4- 

Note the role of the admission price in truthfulness. Had bid bi delayed arrival until 
period 4, its admission price would be max(p^,p^, —00) = max(7, 6.5) = 7 and its payment 
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in period 4 (if it matches) at least 7. Similarly, had ask S4 delayed arrival, then its admission 
price would be max(— 7, —6.5, —00) = —6.5 and the maximal payment it can receive in period 
4 is 6.5. 

Example 6 Consider Chain using the McAfee-based matching rule with K = 3 and with 
the same bids and asks arriving in period 3. Suppose that the prices in periods 1 and 2 that 
would have been faced by a buyer are } = {8, 7} and {p\,p1} = {—7, —6} for a seller. 

These prices are determined by inserting an additional bid (with value 00) or an additional 
ask (with value 0) into the order books in each of periods 1 and 2. We will illustrate this 
for period 3. Consider now the bids and asks in period 3 in Table \M 



i 


Wi 


di 


B 

di-K 


Qi 


Pi 


SNT? 


i 


Wi 


di 


S 

di-K 


Qi 


Pi 


SNT? 


bi* 


15 


4 


2 


7 


7 


N 




-1 


4 


2 


-6 


-4 


N 


b2* 


10 


3 


1 


8 


8 


N 


S2* 


-3 


5 


3 


—00 


-4 


N 




7 


3 


1 


8 


n/a 


N 


S3 


-4 


3 


1 


-6 


n/a 


N 




6 


5 


3 


—00 


n/a 


N 


Si 


-5 


4 


2 


-6 


n/a 


N 
















S5 


-10 


5 


3 


—00 


n/a 


N 



Table 2: Bids and asks that arrive in period 3. Bids {61, 62} match with asks {si, S2} (as indicated 
with a *). Bid 63 is priced-out upon admission because qb^ > wt^. The admission price is 
Qi and the payment made by an agent that trades is pi. Column 'SNT?' indicates whether 
or not the bid or ask satisfies the strong no-trade predicate. No asks or bids survive into 
the next period. 

As before bid 63 is not admitted. The myopic matching rule now runs the (static) McAfee 
auction rule on bids {61,62,^4} CLn'd asks {si,..,S5}. Consider bids and asks in decreasing 
order of value, the last efficient trade is indexed m = 3 with wi,^ + Wg^ = 6 — 4 > 0. But 
Pm+i = (0~(~5))/2 = 2.5 (inserting a dummy bid with value as described in Section \4.^ . 
Price —pfn+i = —2.5 > S3 = —4 and this trade cannot be executed by McAfee. Instead, 
buyers {61,62} trade and face price = Wb^ = 6 and sellers {si,S2} trade and face price 
pf^ = = —4. Bids 64 and asks {53,54,55} are priced-out and do not survive into the 
next round. Ultimately, payment max(g;,^,p(^) = max(7, 6) = 7 is collected from buyer 61 
and payment max(g{,2)Pm) = niax(8, 6) = 8 is collected from buyer 62. For sellers, payment 
max(— 6, —4) = —4 and max(— 00, —4) = —4 for si and S2 respectively. 

The prices and Pg that are used in Eq. ^ to define the admission price for bids and 
asks with arrivals in periods 4 cLnd 5 are determined as follows. For the buy-side price, we 
introduce an additional bid with bid-price 00. With this the bid values considered by McAfee 
would be (00, 15, 10, 6, 0) and the ask values would be ( — 1, —3, —4, —5, —10), where a dummy 
bid with value is included on the buy-side. The last efficient pair to trade is m = A with 
6 — 5 > and Pm+i = (0 — (— 10))/2 = 5, which satisfies this bid-ask pair. Therefore the 
buy-side price, pf := 5. On the sell-side, we introduce an additional ask with ask-price 
so that the bid values considered by McAfee are (15,10,6,0) (again, with a dummy bid 
included) and the ask values are (0,-1,-3,-4,-5,-10). This time m = 3 and the last 
efficient pair to trade zs 6 — 3 > 0. Now Pm+i = (0 — (— 4))/2 = 2 and this price does not 
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satisfy S2, with —pm+i > S2 o,nd price p^+i = S2 = —3 is adopted. Therefore the sell-side 
price, p^ := —3. 

Again, we can see that bidder 1 cannot improve its price by delaying its entry until period 
4- The admission price for the bidder would be max(p^,p^) = max(7,p^) > 7 and thus its 
payment in period 4, if it matches, will be at least 70 Similarly for ask si, which would face 
admission price max{]5^,Pg} = max{— 6, —4} = —4 and can receive a payment of at most 4 
in period 4- We leave it as an exercise for the reader to verify that p^ = —4 if ask si delays 
its arrival until period 4 (in comparison, p^ = —3 when ask si is truthful). 

Because the McAfee-based pricing scheme computes a price and clears the order book 
foUowing every period in which there are at least two bids and two asks, the bid activity 
periods tend to be short in comparison to the adaptive, price-based rules where orders can 
be kept active longer when there is an asymmetry in the number of bids and asks in the 
market. In fact, one interesting artifact that occurs with adaptive, price-based matching 
rules is that the admission-price and SNT can perpetuate this kind of bid-ask asymmetry. 
Once the market has more asks than bids, SNT becomes likely for future asks, but not bids. 
Therefore, bids are much more likely than asks to be immediately priced out of the market 
by failing to meet the admission price constraint. 

5. Theoretical Analysis: Truthfulness, Uniqueness, and Justifying 
Bounded-Patience 

In this section we prove that Chain combined with a well-defined matching rule and a valid 
strong no-trade construction generates a truthful, no-deficit, feasible and individual-rational 
dynamic DA. In Section [5. 2^ we establish that uniqueness of Chain amongst dynamic DAs 
that are constructed from single-period matching rules as building blocks. In Section 15.31 
we establish the importance of the existence of a maximal bound on bidder patience by 
presenting a simple environment in which no truthful, no-deficit DA can implement even a 
single trade despite the number of efficient trades can be increased without bound. 

5.1 The Chain Mechanism is Strongly Truthful 

It will be helpful to adopt a price-based interpretation of a valid single-period matching rule. 
Given rule M^r, define an agent-independent price, Zi{H^, A^\i,uj) G M where ^* = 6* U s*, 
such that for all i, all bids 6*, all asks s*, all history H^, and all random events a; € $7. We 
have: 

(Al) Wi - Zi{H^,A^ \ i,oj) > ^ 7r„„. s\uj) = 1, and Wi - Zi{H*,A* \ i,oj) < 

(A2) payment Xmr,i(-f^*, a;) = Zi{H^ , A^ \ i,uj) li -KmrAiH^ .s^ ,uj) = 1 and 
Xmr,iiH^,b^,s*,^) = otherwise 

4. We can check that pi := 6 in this case. Suppose that bidder 1 were not present in period 3. Now consider 
introducing an additional bid with value oo so that the bids values are {oo, 10, 6, 0} (with a dummy bid) 
with ask values { — 1, —3, —4, —5, —10}. Then m — 3 and Pm+i = (0 — (— 5))/2 = 2.5, which does not 
support the trade between bid 64 and ask S3. Instead, = wt^ = 6 is adopted, and we would have 
pI := 6. Of course, this is exactly the price determined by McAfee for bid bi in period 3 when the bidder 
is truthful. 
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The interpretation is that there is an agent-independent price, Zi{H^,A* \i,u)), that is 
at least Wi when the agent loses and no greater than Wi otherwise. In particular, Zi{H^, A* \ 
= oo when i G NT*. Although an agent's price is only explicit in a matching rule when 
the agent trades, it is well known that such a price exists for any truthful, single-parameter 
mechanism; e.g., see works by Archer and Tardos (?) and Goldberg and Hartline (?)ll 
Moving forward we adopt price Zi to characterize the matching rule used as a building 
block for Chain, and assume without loss of generality properties (Al) and (A2). 

Given this, we will now establish the truthfulness of Chain by appeal to a price-based 
characterization due to Hajiaghayi et al. (?) for truthful, dynamic mechanisms. We state 
(without proof) a variant on the characterization result that holds for stochastic policies 
(vr, x) and siron^-truthfulness. The theorem that we state is also specialized to our DA 
environment. We continue to adopt a; G O to capture the realization of stochastic events 
internal to the mechanism: 

Theorem 2 (?) A dynamic DA M = {tt,x), perhaps stochastic, is strongly truthful for 
misreports limited to no early- arrivals if and only if, for every agent i, all 6i, all O^i, and 
all random events uj ^Vt, there exists a price pi{ai,di,9^i,uj) such that: 

(Bl) the price is independent of agent i's reported value 

(B2) the price is monotonic-increasing in tighter [a[,d[] C [di,di] 

(B3) trade Tii^Oi^O-i) = 1 whenever pi{ai,di, 6 -i,uj) < Wi and TTi{9i,9-i) = whenever 
Pi{ai,di,6-i,uj) > Wi, and the trade is performed for a buyer upon its departure period 
di. 

(B4) the agent's payment is Xi{9i,9-i) = pi{ai,di,9^i,uj) when TTi{9i,9^i) = 1, with 
Xi{9i,9-i) = otherwise, and the payment is made to a seller upon its departure, di. 

where random event uj is independent of the report of agent i in as much as it affects the 
price to agent i. 

Just as for the single-period, price-based characterization, the price pi{ai,di,9-i, uj) need 
not always be explicit in Chain. Rather, the theorem states that given any truthful dynamic 
DA, such as Chain, there exists a well-defined price function with these properties of value- 
independence (Bl) and arrival-departure monotonicity (B2), and such that they define the 
trade (B3) and the payment (B4). 

To establish the truthfulness of Chain, we prove that it is well-defined with respect to 
the following price function: 

Pi{di,di,9_i,u;) = max{q{di,di,9-i,uj),pi{di,di,9^i,uj)), (5) 

where 

q{ai,di,9-i,uj) = max {zi{H\ A'^ \i,uj), -oo) (6) 

iG[rf— -ft',a,-l],i^SNT' 

5. A single-parameter mechanism is one in which the private information of an agent is hmited to one 
number. This fits the single-period matching problem because the arrival and departure information 
is discarded. Moreover, although there are both buyers and sellers, the problem is effectively single- 
parameter because no buyer can usefully pretend to be a seller and vice versa. 
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and 



p{ai,di,0-i,uj) 



{ 



+00 



\i,uj) , if decision{i) = 1 
, otherwise 



(7) 



where decision{i) = indicates that i G SNT* for all t G [ai,(ij] and decision{i) = 1 
otherwise, and where t* G [ai,di] is the first period in which i ^ SNT*. We refer to this 
as the decision period. Term q{ai,di,6^i,uj) denotes the admission price, and is defined on 
periods t before the agent arrives for which i ^ SNT* had it arrived in that period. Note 
carefully that the rules of Chain are implicit in defining this price function. For instance, 
whether or not i G SNT* in some period t depends, for example, on the other bids that 
remain active in that period. 

We now establish conditions (B1)-(B4). The proofs of the technical lemmas are deferred 
until the Appendix. The following lemma is helpful and gets to the heart of the strong no- 
trade concept. 

Lemma 5 The set of active agents ( other than i) in period t in Chain is independent of 
i 's report while agent i remains active, and would be unchanged if i 's arrival is later than 
period t. 

The following result establishes properties (Bl) and (B2). 

Lemma 6 The price constructed from admission price q and post-arrival price p is value- 
independent and monotonic-increasing when the matching rule in Chain is well-defined, 
the strong no-trade construction is valid, and agent patience is bounded by K . 

Having established properties (Bl) and (B2) for price function pi(ai, d^, w), we just 
need to establish (B3) and (B4) to show truthfulness. The timing aspect of (B3) and (B4), 
which requires that the buyer receives an item and the seller receives its payment upon 
reported departure, is already clear from the definition of Chain. 

Theorem 3 The online DA Chain is strongly truthful, no-deficit, feasible and individual- 
rational when the matching rule is well-defined, the strong no-trade construction is valid, 
and agent patience is bounded by K . 

Proof: Properties (Bl) and (B2) follow from Lemma [H The timing aspects of (B3) and 
(B4) are immediate. To complete the proof, we first consider (B3). li q > Wi, then agent 
i is priced out at admission by Chain because this reflects that Zi{H^,A* \i,^) > Wi in 
some t G [dj — K, dj — 1] with i ^ SNT*, and thus the bid would lose if it arrived in that 
period (either because it could trade, but for a payment greater than its reported value, or 
because i G NT*). Also, if there is no decision period, then p = 00, which is consistent with 
Chain, because there is no bid price at which a bid will trade when i G SNT* for all periods 
t G [di,di]. Suppose now that there is a decision period t* and q < Wi. lip> Wi, then there 
should be no trade. This is the case in Chain, because the price Zi(H^* , A^* \ i,uj) in t* is 
greater than Wi and thus the agent is priced-out. li p < Wi then the bid should trade and 
indeed it does, again because the price in that period satisfies (Al) and (A2) with respect 
to the matching rule. Turning to (B4), it is immediate that the payments collected in Chain 
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are equal to price pi{ai,di,6-i,uj), because if bid i trades then pi{ai,di,9^i,oj) < wi and 
thus q < Wi and p < Wi. The admission price q{di,di,6-i,Lj) = q{di,di,9^i,uj) when q < Wi 
because price Zi is well-defined by properties (Al) and (A2). Similarly, the payment p** 
defined by the matching rule in Chain in the decision period is equal to p. 

That Chain is individual-rational and feasible follows from inspection. Chain is no- 
deficit because the payment collected from every agent (whether a buyer or a seller) is at 
least that defined by a valid matching rule in the decision period t* (it can be higher when 
the admission price is higher than this matching price), the matching rules are themselves 
no-deficit, and because the auctioneer delays making a payment to a seller until its reported 
departure but collects payment from a buyer immediately upon a match. □ 

We remark that information can be reported to bidders that are not currently partici- 
pating in the market, for instance to assist in their valuation process. If this information 
is delayed by at least the maximal patience of a bidder, so that the bid of a current bidder 
cannot infiuence the other bids and asks that it faces, then this is without any strategic 
consequences. Of course, without this constraint, or with bidders that participate in the 
market multiple times, the effect of such feedback would require careful analysis and bring 
us outside of the private values framework. 

5.2 Chain is Unique amongst Dynamic DAs that are constructed from Myopic 
Matching Rules 

In what follows, we establish that Chain is unique amongst all truthful, dynamic DAs that 
adopt well-defined, myopic matching rules as simple building blocks. For this, we define 
the class of canonical, dynamic DAs, which take a well-defined single period matching rule 
coupled with a valid strong no-trade construction, and satisfy the following requirements: 

(i) agents are active until they are matched or priced-out, 

(ii) agents participate in the single-period matching rule while active 

(iii) agents are matched if and only if they trade in the single-period matching rule. 

We think that these restrictions capture the essence of what it means to construct a 
dynamic DA from single-period matching rules. Notice that a number of design elements are 
left undefined, including the payment collected from matched bids, when to mark an active 
bid as priced-out, what rule to use upon admission, and how to use the strong no-trade 
information within the dynamic DA. In establishing a uniqueness result, we leverage the 
necessary and sufficient price-based characterization in Theorem [2l and exactly determine 
the price function pi{di,di,9-i,uj) to that defined in Eq. dH) and associated with Chain. 
The proofs for the two technical lemmas are deferred until the Appendix. 

Lemma 7 A strongly truthful, canonical dynamic DA must define price pi{di,di,9^i,u}) > 
Zi{H^*,A^* \ i,uj) where t* is the decision period for bid i (if it exists). Moreover, the bid 
must be priced-out in period t* if it is not matched. 

Lemma 8 A strongly truthful, canonical and individual-rational dynamic DA must define 
price pi{di,di,9-i,LLi) > q{ai,di,9^i,u}), and a bid with Wi < q{di,di,9^i,uj) must be priced- 
out upon admission. 
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Theorem 4 The dynamic DA algorithm Chain uniquely defines a strongly truthful, 
individual-rational auction among canonical dynamic DAs that only designate bids as priced- 
out when necessary. 

Proof: If there is no decision period, then we must have pi{ai,di,9-i, uj) = oo, by canonical 
(iii) coupled with (B3). Combining this with Lemmas [7] and [HI we have pi{ai,di,9_i,uj) > 
mayi{q{ai,di,9-i,Lo),p{ai,di,6-i,uj)). We have also established that a bid must be priced- 
out if its bid value is less than the admission price, or it fails to match in its decision 
period. Left to show is that the price is exactly as in Chain, and that a bid is admitted 
when its value Wi > q{ai,di,9_i,uj) and retained as active when it is in the strong no- 
trade set. The last two control aspects are determined once we choose a rule that "only 
designates bids as priced-out when necessary." We prefer to allow a bid to remain active 
when this does not compromise truthfulness or individual-rationality. Finally, suppose 
for contradiction that p' = pi{ai,di,9-i,uj) > iiiax{q{ai,di,9-i,uj),p{ai,di,9-i,uj)). Then 
an agent with max(q{ai,di,9-i,uj),p{ai,di,9-i,uj)) < Wi < p' would prefer to bid Wi = 
q{ai,di,9^i,uj),p{ai,di,9-i,uj)) — e and avoid winning - otherwise its payment would be 
greater than its value. □ 

5.3 Bounded Patience Is Required for Reasonable Efficiency 

Chain depends on a maximal bound on patience used to calculate the admission price faced 
by a bidder on entering the market with Eq. ([3]). To motivate this assumption about the 
existence of a maximal patience, we construct a simple environment in which the number of 
trades implemented by a truthful, no-deficit DA can be made an arbitrarily small fraction 
of the number of efficient trades with even a small number of bidders having potentially un- 
bounded patience. This illustrates that a bound on bidder patience is required for dynamic 
DAs with reasonable performance. 

In achieving this negative result, we impose the additional requirement of anonymity, 
This anonymity property is already satisfied by Chain, when coupled with matching rules 
that satisfy anonymity, as is the case with all the rules presented in Section [H In defining 
anonymity, extend the earlier definition of a dynamic DA, M = {tt,x), so that allocation 
policy TT = {vr*}*^-^ defines the probability 7r*(6'-*) G [0, 1] that agent i trades in period t 
given reports 9-^. Payment, x = {x^Y^^ , continues to define the payment by agent 

i in period t, and is a random variable when the mechanism is stochastic. 

Definition 8 (anonymity) A dynamic DA, M = (vr, x) is anonymous if allocation policy 
TT = {vr*}*^"^ defines probability of trade 7r|(0-*) in each period t that is independent of 
identity i and invariant to a permutation of {9-^\i) and if the payment x\(9-^), contingent 
on trade by agent i, is independent of identity i and invariant to a permutation of (0-* \ i). 

We now consider the following simple environment. Informally, there will be a random 
number of high-valued phases in which bids and asks have high value and there might be a 
single bidder with patience that exceeds that of the other bids and asks in the phase. These 
high-valued phases are then followed by some number, perhaps zero, of low-valued phases 
with bounded-patience bids and asks. Formally, there are > 1 high-valued phases 
(a random variable, unknown to the auction), each of duration L > 1 periods, indexed 
A; G {0, 1, . . . , T/j — 1} and each with: 
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• N or N -1 bids with type (1 + kL, {k + 1)L,vh), 

• or 1 bids with type (1 + kL, d, avn) for some mark-up parameter, a > 1 and some 
high-patience parameter, d €T, 

• asks with type (1 + kL, {k + —{vh — e)), 

followed by some number (perhaps zero) of low-valued phases, also of duration L, and 
indexed k G {T/j, . . . , oo}, with: 

• A^ or - 1 bids with type (1 + kL, {k + 1)L, vl) 

• A^ asks with type (1 + kL, {k + —{vl — e)), 

where N>1,{)<vl<vh-, and bid-spread parameter e > 0. Note that any phase can 
be the last phase, with no additional bids or asks arriving in the future. 

Definition 9 (reasonable DA) A dynamic DA is reasonable in this simple environment 
if there is some parameterization of new bids, N > 1, and periods-per-phase, L > 1, for 
which it will execute at least one trade between new bids and new asks in each phase, 
for any choice of high value vh, low value vl < vh, bid-spread e > 0, mark-up a> 1, high 
patience d. 

All of the dynamic DAs presented in Section S] can be parameterized to make them 
reasonable for a suitably large A^ > 1 and L > 1, and without the possibility of a bid with 
an unbounded patience. 

Theorem 5 No strongly truthful, individual-rational, no-deficit, feasible, anonymous dy- 
namic DA can be reasonable when a bidder's patience can be unbounded. 

Proof: Fix any A^ > 1, L > 1, and for the number of high- valued phases, T/^ > 1, set the 
departure of a high-patience agent to d = (Th + 1)L. Keep vh > vl > 0, e > 0, and a > 1 
as variables to be set within the proof. Assume a dynamic DA is reasonable, so that it 
selects at least one new bid-ask pair to trade in each phase. Consider phase k = and with 
A" — 1 agents of types {1,L,vh), N of type {1,L, —{vh — e)) and 1 agent of patient type, 
(1, (Tfi + l)L,avH)- If the patient bid deviates to {1,L,vh), then the bids are all identical, 
and with probability at least 1/A^ the bid would win by anonymity and reasonableness. 
Also, by anonymity, individual-rationality and no-deficit we have that the payment made 
by any winning bid is the same, and must be p' € [vh — e, vh] ■ (If the payment had been less 
than this, the DA would run at a deficit since the sellers require at least this much payment 
for individual-rationality.) Condition now on the case that the patient bid would win if it 
deviates and reports {1,L,vh)- Suppose the bidder is truthful, reports (1, (T^ + l)L,avH) 
but does not trade in this phase. But, if phase /c = is the last phase with new bids and 
asks, then the bid will not be able to trade in the future and for strong-truthfulness the 
DA would need to make a payment of at least avn — vh = (a — 1)vh in a later phase to 
prevent the bid having a useful deviation to (1, L, vh) and winning in phase k = {). But, if: 

Ne<{a-l)vH, (8) 
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then the DA cannot make this payment without faihng no-deficit (because A^e is an upper- 
bound on the surplus the auctioneer could extract from bidders in this phase without 
violating individual-rationality). We will later pick values of a, e and vh, to satisfy Eq. ([8]). 
So, the bid must trade when it reports (1, (Th -\- 1)L, avn), in the event that it would win 
with report {1,L,vh), as "insurance" against this being the last phase with new bids and 
asks. Moreover, it should trade for payment, p' € [vh — e, vh], to ensure an agent with true 
type {1,L,vh) cannot benefit by reporting (1, {Th + l)L,avH)- 

Now suppose that this was not the last phase with new bids and asks, and > 1. 
Now consider what would happen if the patient bid in phase A; = deviated and reported 
(1 -|- TfiL, (Tfi + 1)L,vl). As before, this bid would win with probability at least by 
anonymity and reasonableness, but now with some payment p" G [vl — e,VL]. Condition 
now on the case that the patient bid would win, both with a report of {1,L,vh) and with a 
report of (1 -|- T^L, (T^ + 1)L, vl)- When truthful, it trades in phase k = with payment at 
least Vh — e- If it had reported (1 -|- T^L, {T^ + 1)L, vl), it would trade in phase k = Th for 
payment at most vl. For strong truthfulness, the DA must make an additional payment to 
the patient agent of at least {vh — vl) — {vh — {vh — ^)) = vh — vl — ^- But, suppose that 
the high and low values are such that, 

{Th + l)Ne <VH-VL-e. (9) 

Making this payment in this case would violate no-deficit, because {Th+1)N€ is an upper- 
bound on the surplus the auctioneer can extract from bidders across all phases, including 
the current phase, without violating individual-rationality. But now we can fix any vl > 0, 
€ < Vl and choose vh > {Th + l)A^e + vl + e to satisfy Eq. Q and a > {Ne/vn) + 1 
to satisfy Eq. Thus, we have proved that no truthful dynamic DA can choose a bid- 
ask pair to trade in period k = 0. The proof can be readily extended to show a similar 
problem with choosing a bid-ask pair in any period k < Th, by considering truthful type of 
{l + kL,{Th + l)L,avH). □ 

To drive home the negative result: notice that the number of efficient trades can be 
increased without limit by choosing an arbitrarily large Th, and that no truthful, dy- 
namic DA with these properties will be able to execute even a single trade in each of 
these {0, . . . ,Th — 1} periods. Moreover, we see that only a vanishingly small fraction of 
high-patience agents is required for this negative result. The proof only requires that at 
least one patient agent is possible in all of the high-valued phases. 

6. Experimental Analysis 

In this section, we evaluate in simulation each of the CHAiN-based DAs introduced in 
Section m We measure the allocative efficiency (total value from the trades), net efficiency 
(total value discounted for the revenue that flows to the auctioneer), and revenue to the 
auctioneer. All values are normalized by the total offline value of the optimal matching. 

For comparison we also implement several other matching schemes: the truthful, surplus- 
maximizing matching algorithm presented by Blum et al. (?), an untruthful greedy matching 
algorithm using truthful bids as input to provide an upper-bound on performance, and an 
untruthful DA populated with simple adaptive agents that are modeled after the Zero- 
intelligence Plus trading algorithm that has been leveraged in the study of static DAs (?, 
?)■ 
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6.1 Experimental Set-up 

Traders arrive to the market as a Poisson stream to exchange a single commodity at discrete 
moments. This is a standard model of arrival in dynamic systems, economic or otherwise. 
Each trader, equally likely to be a buyer or seller, arrives after the previous with an expo- 
nentially distributed delay, with probability density function (pdf): 

f{x) = Ae"^^, x>0, (10) 

where A > represents the arrival intensity in agents per second. Later we present results 
as the interarrival time, j, is varied between 0.05 and 1.5; i.e., as the arrival intensity is 
varied between 20 and |. A single trial continues until at least 5,000 buyers and 5,000 sellers 
have entered the market. In our experiments we vary the maximal patience K between 2 
and 10. For the distribution on an agent's activity period (or patience, di — ai), we consider 
both a uniform distribution with pdf: 

/(x) = -i, xG[0,K], (11) 

and a truncated exponential distribution with pdf: 

/(x) = ae-°^ xG[0,i^], (12) 

where a = — ln(0.05)/i<r so that 95% of the underlying exponential distribution is less than 
the maximal patience. Both arrival time and activity duration are rounded to the nearest 
integral time period. A trader who arrives and departs during the same period is assumed 
to need an immediate trade and is active for only one period. 

Each trader's valuation represents a sample drawn at its arrival from a Tiniform distri- 
bution with spread 20% about the current mean valuation. (The value is positive for a bid 
and negative for an ask.) To simulate market volatility, we run experiments that vary the 
average valuation using Brownian motion, a common model for valuation volatility upon 
which many option pricing models are based (?). At every time period, the mean valuation 
randomly increases or decreases by a constant multiplier, e^^ , where 7 is the approximate 
volatility and varied between and 0.15 in our experiments. 

We plot the mean efficiency of 100 runs for each experiment, with the same sets of bids 
and asks used across all double auctions. All parameters of an aTiction rule arc rcoptimized 
for each market environment; e.g., we can find the optimal fixed price and the optimal 
smoothing parameters offline given the ability to sample from the market model. 

6.2 Chain Implementation 

We implement Chain for the five price-based matching rules (history-clearing, history- 
median, history-McAfee, history-EWMA, and fixed-price) and the three competition-based 
matching rules (McAfee, active-McAfee, and windowed-McAfee). 

The price-based implementations keep a fixed-size set of the most recently expired, 
traded, or priced-out offers, H*. Offers priced-out by their admission prices are inserted 
into prior to computing p*. The history-clearing metric computes a price to maximize the 
number of trades to agents represented by had they all been contemporary. The history- 
median metric chooses the price to be the median of the absolute valuation of the offers 
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in H^. The history- McAfee method computes the "McAfee price" for the scenario where 
ah agents represented by are simultaneously present. The EWMA metric computes an 
exponentially- weighted average of bids in the order that they expire, trade, or price out. 
The simulations initialize the price to the average of the mean buy and sell valuations. If 
two bids expire during the same period, they are included in arbitrary order to the moving 
average. 

None of the metrics require more than one parameter, which is optimized offline with 
access to the model of the market environment. Parameter optimization proceeds by uni- 
formly sampling the parameter range, smoothing the result by averaging each result with 
its immediate neighbors. The optimization repeats twice more over a narrower range about 
the smoothed maximum, returning the parameter that maximizes (expected) allocative effi- 
ciency. None of the price-based methods appeared to be sensitive to small (<10%) changes 
in the size of H^. With most simulations, the window size was chosen to be about 150 of- 
fers. For EWMA, the smoothing factor was usually chosen to be around 0.05 or lower. The 
windowed-McAfee matching rule, however, was extremely sensitive to window size for simu- 
lations with volatile valuations, and the search process frequently converged to suboptimal 
local maxima. 

The admission price in the price-based methods is computed by first determining 
whether MATCH would check the value of the bid against bid price if the bid had arrived in 
some earlier period t'. Rather than simulate the entire Match procedure, it is sufficient to 
determine the probability pi of this event. This is determined by checking the construction 
of the strong no-trade sets in that earlier period. If SNT* contains non-departing buyers 
(sellers), then the probability that an additional seller (buyer) would be examined is 1 and 
Pi = 1. Otherwise the probability is equal to the ratio of the number of bids (asks) ex- 
amined not included in SNT* and one more than the total number of bids (asks) present. 
Finally, with probability pi the price the agent would have faced in period t' is defined as 
(— for sellers), and otherwise it is — oo. Here, is the history-dependent price defined 
in period t'. 

The competition-based matching rules price out all non-trading bids at the end of each 
period in which trade occurs (because of the definition of strong no-trade in that context). 
The admission prices are calculated by considering the price that a bid (ask) would have 
faced in some period t' before its reported arrival. In such a period, the price for a bid (ask) 
is determined by inserting an additional bid (ask) with valuation oo (0) and applying the 
competition-based matching rule to that (counterfactual) state. From this we determine 
whether the agent would win for its reported value, and if so what price it would face. 



6.3 Optimal Offline Matching 

We use a commercial integer program solver (CPLEXH) to compute the optimal offiine 
solution, i.e. with complete knowledge about all offers received over time. In determining 
the offiine solution we enforce the constraint that a trade can only be executed if the activity 
periods of both buyer, i, and seller, j, overlap. 



(flj <dj) A (aj < di) (13) 



6. www.ilog.com 
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An integer-program formulation to maximize total value is: 

max Xij{wi + Wj) (14) 

s.t. < Xij < 1, Vj € ask 

i:{i,j)£overlap 

< ^ Xij < 1, Vi e 

jr:(j,jr)Got)eriap 

Xij G {0, 1}, Vi, j, 

where € overlap is a bid-ask pair that could potentially trade because they have 

overlapping arrival and departure intervals satisfying Eq. (fT3|) . The decision variable Xij G 
{0, 1} indicates that bid i matches with ask j. This provides the optimal, offline allocative 
efficiency. 



6.4 Greedy Online Matching 

We implement a greedy matching algorithm that immediately matches offers that yield non- 
negative budget surplus. This is a non-truthful matching rule but provides an additional 
comparison point for the efficiency of the other matching schemes. During each time period, 
the greedy matching algorithm orders active bids and asks by their valuations, exactly as 
the McAfee mechanism does, and matches offers until pairs no longer generate positive 
surplus. The algorithm's performance allows us to infer the number of offers that the 
optimal matching defers before matching and the amount of surplus lost by the McAfee 
method due to trade reduction and due to the additional constraint of admission pricing. 

6.5 Worst- Case Optimal Matching 

Blum et al. (?) derive a mechanism equivalent to our fixed-price matching mechanism, 
except that the price used is chosen from the cumulative distribution 

I^(x) = -lnf ), (15) 

ra \{r-l)WminJ 

where r is the fixed point to the equation 



and Wmin ^ and Wmax ^ are the minimum and maximum absolute valuations of all 
traders in the market. For our simulations, we give the mechanism the exact knowledge of 
the minimum and maximum absolute valuations for each schedule. Blum et al. (?) show 
that this method guarantees an expected competitive ratio of ma,'K{2,\n{wmax / Wmin)) with 
respect to the optimal offline solution in an adversarial setting. We were interested to see 
how will this performed in practice in our simulations. 



163 



Bredin, Parkes and Duong 



6.6 Strategic Open-outcry Matching: ZIP Agents 

To compare Chain with the existing hterature on continuous double auctions, we implement 

a DA that in every period sorts all active offers and matches the highest valued bids with 
the lowest valued asks so long as the match yields positive net surplus. The DA prices each 
trading pair at the mean of the pair's declared valuations. Since the trade price depends 
on a bidder's declaration, the market does not support truthful bidding strategies. We must 
therefore adopt a method to simulate the behavior of bidding agents within this simple open 
outcry market. 

For this, we randomly assign each bid to one of several "protocol agents" that each 
use a modified ZIP trading algorithm, as initially presented by Cliff and Bruten (?) and 
improved upon by Preist and van Tol (?). The ZIP algorithm is a common benchmark 
used to compare learned bidding behavior in a simple double-auction trading environment 
in which agents are present at once and adjust their bids in seeking a profitable trade. We 
adapt the ZIP algorithm for use in our dynamic environment. 

In our experiments we consider five of these protocol agents. New offers are assigned 
uniformly at random to a protocol agent, which remains persistent throughout the simula- 
tion. Each offer is associated with a patience category, k € {low, medium, high}, defined 
to evenly partition the range of possible offer patience. Each protocol agent, j, is defined 
with parameters {rj,l3j,jj) and maintains a profit margin, jjij, on each patience category k. 
Parameters {I3j,jj) control the adaptivity of the protocol agent in how it adjusts the target 
profit margin on an individual offer, with /3j ~ [7(0.1,0.2) defining the offer-level learning 
rate and 7^ ~ C/(0.2,0.8) defining the offer-level damping factor. Parameter rj € [0,1] is 
the learning rate adopted for updating the profit margins. 

The protocol agents arc trained over 10 trials and their final performance is measured 
in the 11th trial. The learning rate decreases through the training session and depends on 
the initial learning rate and the adjustment rate r^ . In period i G {1, . . . ,t^^^} of trial 
A; G {1, . . . ,T -I- 1}, where T = 10 is the number of trials used for training and t^^^ is the 
number of periods in trial k, the learning rate is defined as: 

r,:=l-(^r^ + (fc-l)r;+(^)%;j (17) 

where = (1 — rj )/{T + 1). We define := 0.7. The effect of this adjustment rule is that 
rj is initially 0.3, decreases during training, and trends to 0.0 as f — )■ t^^^ in trial k = 11. 

Within a given trial, upon assignment of a new offer i in patience category k, the protocol 
agent managing the offer initializes {iii[t), 6i{t)) := {fij, 0), where fii{t) represents the target 
profit margin for the offer and 6i{t) represents a profit-margin correction term. The target 
profit margin and the profit margin correction term are adjusted for offer i in subsequent 
periods while the bid remains active. 

The target profit margin is used to define a bid price for the offer in each period while 
it remains active: 

Wi{t):=Wi{l-\-fii(t)). (18) 
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At the end of a period in which an offer matches or simply expires, the profit margin 
for its patience category is updated as: 

fi'; ■.= {l-r,)^i'; + rJ^,i{t), (19) 

where the amount of adaptivity depends on the learning rate rj. Because the profit margin 
on an offer decays over its lifetime, this update adjusts towards a small profit margin if 
the offer expires or took many periods to trade, and a larger profit margin otherwise. The 
long-term learning of a protocol agent occurs through the profit margin assigned to each 
patience category. 

At the start of a period each protocol agent also computes target prices for bids and 
asks in each patience category. These are used to drive an adjustment in the target profit 
margin for each active bid and ask. Target prices T^{t) and Tg{t) are computed as: 



r (1+7?) max {wi{t-l)}+C 

^bW- \ {wi{t-l)}-i 

\ iGBfe(t-l) 

and, 

( (1+r?) max {wi{t - I)} + ^ 

k(.^ .= J iG5'=(t-l) 

1 (1-r?) max {wAt - I)} - ^ 



, if > max {wi{t — 1)} + max {wi{t — 1)} 
ieS{t-l) i(zBk[t-l) 

, otherwise 

(20) 

, if > max {wi{t — 1)} + max {wi{t — 1)} 

iG-B(i-l) i£S^{t-l) 

, otherwise 

(21) 



where ^, ~ ?7(0, 0.05). Here, B{t — 1) and S{t — 1) denote the set of active bids and asks 
in the market in period t — 1 (defined before market clearing), and B^{t — 1) and S^{t — 1) 
denote the restrictions to patience category k. The target price on a bid in category k 
is set to something slightly greater than the most competitive bid in the previous round 
when that bid could not trade, and slightly less otherwise. Similarly for the target price on 
asks, where these prices are negative, so that increasing the target price makes an ask more 
competitive. 

Target prices are used to adjust the target profit margin at the start of each period on 
all active offers that arrived in some earlier period, where the influence of target prices is 
through the profit-margin correction term: 

,m ■■= - 1, (22) 

Wi 

and the profit-margin correction term, 5j(t), is defined in terms of the target price Tf{t) 
(equal to T^{t) if i is a bid and Tg{t) otherwise) as, 

m := ^j6i{t - 1) + (1 - l,)Pj{THt) - Wi{t - 1)), (23) 

where and Pj are the offer-level learning rates and damping factor. The value Wi and the 
"-1" term in Eq. (|22p provide normalization. Eq. (j23p is the Widrow-Hoff (?) rule, designed 
to minimize the least mean square error in the profit margin and adopted here to mimic 
earlier ZIP designs. 
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6.7 Experimental Results 

Our experimental results show that market conditions drive DA choice. We compare al- 
locative efficiency, revenue, and net efficiency. All results are averaged over 100 trials. 
In experiments we found only minimal qualitative differences between the use of the two 
patience distributions. The uniform patience distribution provides a slight increase in ef- 
ficiency over result using exponential patience, caused by a larger proportion of patient 
agents which relaxes somewhat the admission-price constraint in Eq. ([3]). For this reason 
we choose to report only results for the uniform patience distribution. 

While the performance of all methods are summarized in Table [3l we omit the perfor- 
mance of some markets from the plots to keep the presentation of results as clear as possible. 
We do not plot the price-based results for the median- or clearing-based prices because the 
performance was typically around that of the performance of Chain instantiated on the 
history-EWMA price. We do not plot the windowed-McAfee results because of inconsis- 
tent performance, and in most cases, upon manual inspection, it was optimal to choose the 
smallest possible window size, i.e. including only active bids and making it equivalent to 
active-McAfee. 

Our plots also leave out the performance of the Blum et al. (?) worst-case optimal 
matching scheme because it was dominated by the fixed-price Chain instantiation and in 
many cases failed to yield any substantial surplus. We note here that the modeling assump- 
tion made by Blum et al. (?) is quite different than that in our work: they worry about 
performance in an adversarial environment while we consider probabilistic environments. 
Our fixed-price Chain mechanism operates essentially identically to the surplus-maximizing 
scheme of Blum et al. (?), except that Chain can also use additional statistical information 
to set the ideal price, rather than drawing the price from a distribution that is used to 
guarantee worst-case performance against an adversary. We defer the results for the Blum 
et al. (?) scheme to Table El 

Figures [HHS] plot results from two sets of experiments, one for high-patience/low- volatility 
and one for low-patience/high- volatility, as we vary the inter-arrival time (and thus the 
arrival intensity), volatility and maximal patience. All plots are for allocative efficiency 
except Figure [H where we consider net efficiency. Active- McAfee is included on Figure O 
but not on any other plots because it did not improve upon the McAfee performance in 
the other environments. To emphasize: the results for greedy provide an upper-bound on 
the best possible performance because this is a non-truthful algorithm, simulated here with 
truthful inputs. 

In Figure [5] (left) we see that from within the truthful DAs, the McAfee-based DA has 
the best efficiency for medium to low arrival intensities. There also is a general decrease 
in performance, relative to the optimal offline solution, as the arrival intensity falls. This 
trend, also observed with the greedy (non-truthful) DA, occurs because the Chain scheme 
is myopic in that it matches as soon as the static DA building block finds a match, while 
it is better to be less myopic when arrival intensity is low. The McAfee-based DAs are less 
sensitive to this than other methods because they can aggressively update prices using the 
active traders. The price-based DAs experience inefficiencies due to the lag in price updates 
because they use only expired, traded, and priced-out offers to calculate prices. 
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Figure 5: AUocative efficiency vs. inter-arrival time (1 / intensity) for several DAs. The left 
plot shows high-patience, low- volatility simulations, whereas the right plots results from 
low-patience, high-volatility runs. Both sets of experiments use uniform patience distri- 
butions. 
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Figure 6: Net efficiency vs. inter-arrival time (1 / intensity) for several DAs. The left plot 
shows high-patience, low-volatility simulations, whereas the right plots results from low- 
patience, high-volatility runs. Both sets of experiments use uniform patience distribu- 
tions. 



167 



Bredin, Parkes and Duong 



For very high arrival intensity we see Active-McAfee dominates McAfee. Active-McAfee 
smooths the price, which helps to mitigate the impact of fluctuations in cost on the admission 
price via Eq. ([3]) in return for less responsiveness. This is helpful in "well-behaved" markets 
with high arrival intensity and low volatility but was not helpful in most environments we 
studied, where the additional responsiveness provided by the (vanilla) McAfee scheme paid 
off. 

The ZIP market also has good performance in this high-patience/low- volatility environ- 
ment. The reason is simple: this is an easy environment for simple learning agents, and the 
agents quickly learn to be truthful. We emphasize that these ZIP market results should be 
treated with caution and are certainly optimistic. This is because the ZIP agents are not 
programmed to consider timing-based manipulations. The effect in this environment is that 
the ZIP market tends to operate as if a truthful market, but without the cost of imposing 
truthfulness explicitly via market-clearing rules. By comparison the Chain auctions are 
fully strategyproof, to both value and temporal manipulations. 

Compare now with Figure [5] (right), which is for low patience and high volatility. Now we 
see that McAfee dominates across the range of arrival intensities. Moreover, the performance 
of ZIP is now quite poor because the agents do not have enough time to adjust their bids 
(patience is low) and high volatility makes this a more difficult environment. With volatile 
valuations, the possibility of valuation swings leaves open the possibility of larger profits, 
luring agents to set wider profit margins, but only after the market changes. The ZIP agents 
also have fewer concurrent competitive offers to use in setting useful price targets during 
learning. As we might expect, high volatility also negatively impacts the efficiency of the 
fixed-price scheme. 

In Figure [6] we see that the net efficiency trends are qualitatively similar except that the 
competition-based DAs such as McAfee fare less well in comparison with the price-based 
DAs. The auctioneer accrues more revenue for competition-based matching rules such as 
McAfee because they often generate buy and sell prices with a spread. Together with 
the competition-based schemes being intrinsically more dynamic, this drives an increased 
price spread in Chain via the admission price constraints. In Figure [6] (left) we see that 
the fixed-price scheme performs well for high arrival intensity while EWMA dominates for 
intermediate arrival intensities. The McAfee scheme is still dominant for lower patience and 
higher volatility (Figure [6l right). 

To reinforce these observations, in Table [3] we present the the net efficiency, allocative 
efficiency and (normalized) revenue across all arrival intensities (i.e. inter-arrival time from 
0.05 to 1.5) and for both low and high volatility trials. All five price-based methods, all 
three competition-based methods, and all three comparison methods are included. We 
highlight the best performing competition-based method, price-based method, as well as 
the performance of the ZIP market (skipping over the non-truthful, greedy algorithm). We 
omit information about the mean standard error for each measurement because in no case 
did this error exceed a tenth of a percent of the mean optimal surplus. From within the 
truthful DAs, we see that the McAfee-based scheme dominates overall for both allocative 
and net efficiency and both low and high volatility, although EWMA competes with McAfee 
for net efficiency in low volatility markets. Notice also the good performance of the ZIP- 
based market (with the aforementioned caveat about the restricted strategy space) at low 
volatilities. 
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Table 3: Net efficiency, allocative efficiency and auctioneer revenue (all normalized by the optimal 
value from trade), averaged across all arrival intensities (0.05-1.5) and for low and high 
value volatility. The best performing competition-based, price-based and 'other' (ignoring 
greedy, which is not truthful) results are highlighted. 
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Figure 7: Allocative efhciency vs. volatility for several DAs for a fairly low arrival intensity. The 
left plot is for large maximal patience and the right plot is for small maximal patience. 
Both sets of experiments use uniform patience distributions. 



Figure [7| plots allocative efficiency versus volatility for high patience (left) and low 
patience (right) and for fairly low arrival intensity. Higher volatility hurts all methods - 
especially the ZIP agents, which struggle to learn appropriate profit and price targets, 
probably due to few opportunities to update prices for every individual offer. The McAfee 
scheme fairs very well, showing good robustness for both large patience and small patience 
environments. The fixed-price scheme has the best performance when there is zero volatility 
but its efficiency falls off extremely quickly as volatility increases. 
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Figure 8: AUocative efficiency vs. maximal patience for several DAs and fairly low arrival intensity. 

The left plot is for low volatility and the right plot is for high volatility. Both sets of 
experiments use uniform patience distributions. 



We also consider the effect of varying maximal patience. This is shown in Figure [HI 
with low volatility (left) and high volatility (right). Again, the McAfee scheme is the best 
of the truthful DAs based on Chain. We also see that the performance of ZIP improves 
as patience increases due to more opportunities for learning. Perversely, a larger patience 
can negatively affect the truthful DAs. In part this is simply because the performance of 
greedy online schemes, relative to the offline optimal, decreases as patience increases and 
the offline optimal matching is able to draw more benefit from its lack of myopia. 

We also suspected another culprit, however. The possibility of the presence of patient 
agents requires the truthful DAs to include additional terms in the max operator in Eq. ([3]) 
to prevent manipulations, leading to higher admission prices and less admitted offers. To 
better understand this effect we experimented with delayed market clearing in the McAfee 
scheme, where the market matches agents only every r-th period (the "clearing duration"). 
The idea is to make a tradeoff between using fewer admission prices and the possibility that 
we will miss the opportunity to match some impatient offers. 

Figure [9] shows allocative efficiency when the matching mechanism clears less frequently 
and for different maximal patience, K. Figure [9] (left) is for low volatility. There we 
see that the best clearing duration is roughly 1, 2, 3 and 4 for maximal patience of IT G 
{4, 6, 8, 10} and that by optimizing the clearing duration the performance of McAfee remains 
approximately constant as maximal patience increases. In Figure [9] (right) we consider the 
effect in a high volatility environment, with these results averaged over 500 trials because the 
performance of the DA has higher variance. We see a qualitatively similar trend, although 
higher maximal patience now hurts overall and cannot be fully compensated for by tuning 
the clearing duration. 



7. Related Work 

Static two-sided market problems have been widely studied (?, ?, ?, ?, ?). In a classic result, 
Myerson and Satterthwaite proved that it is impossible to achieve efficiency with voluntary 
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Figure 9: Allocative efficiency vs. clearing duration in the McAfee-based Chain auction for fairly 
low arrival intensity and as maximal patience is varied from 4 to 10. The left plot is 
for low volatility and the right plot is for high volatility. Both sets of experiments use 
uniform patience distributions. 



participation and without running a deficit, even relaxing dominant-strategy equilibrium to 
a Bayesian-Nash equilibrium. Some truthful DAs are known for static problems (?, ?, ?, ?). 
For instance, McAfee introduced a DA that sometimes forfeits trade in return for achieving 
truthfulness. McAfee's auction achieves asymptotic efficiency as the number of buyers and 
sellers increases. Huang et al. extend McAfee's mechanism to handle agents exchanging 
multiple units of a single commodity. Babaioff and colleagues have considered extensions 
of this work to supply-chain and spatially distributed markets. 

Our problem is also similar to a traditional continuous double auction (CDA), where 
buyers and sellers may at any time submit offers to a market that pairs an offer as soon 
as a matching offer is submitted. Early work considered market efficiency of CDAs with 
human experiments in labs (?), while recent work investigates the use of software agents 
to execute trades (?, ?, ?, ?). While these markets have no dominant strategy equilibria, 
populations of software trading agents can learn to extract virtually all available surplus, 
and even simple automated trading strategies outperform human traders (?). However, 
these studies of CDAs assume that all traders share a known deadline by which trades must 
be executed. This is quite different from our setting, in which we have dynamic arrival and 
departure. 

Truthful one-sided online auctions, in which agents arrive and depart across time, have 
received some recent attention (?, ?, ?, ?, ?). We adopt and extend the monotonicity-based 
truthful characterization in the work of Hajiaghayi et al. (?) in developing our framework 
for truthful DAs. Our model of DAs must also address some of the same constraints on 
timing that occur in Porter, Hajiaghayi, and Lavi and Nisan's work. In these previous 
works, the items were reusable or expiring and could only be allocated in particular periods. 
In our work we provide limited allowance to the match-maker, allowing it to hold onto a 
seller's item until a matched buyer is ready to depart (perhaps after the seller has departed). 
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The closest work in the hterature is due to Blum et al. (?), who present online matching 
algorithms for the same dynamic DA model. The main focus in their paper is on the 
design of matching algorithms with good worst-case performance in an adversarial setting, 
i.e. within the framework of competitive analysis. Issues related to incentive compatibility 
receive less attention. One way in which their work is more general is that they also study 
goals of profit and maximizing the number of trades, in addition to the goal of maximizing 
social welfare that we consider in our work. However, the only algorithmic result that they 
present that is truthful in our model (where agents can misreport arrival and departure) is 
for the goal of social welfare. The DA that they describe is an instance of Chain in which 
a fixed price is drawn from a distribution at the start of time, and used as the matching 
price in every period. Perhaps unsurprisingly, given their worst-case approach, we observe 
that their auction performs significantly worse than Chain defined for a fixed price that is 
picked to optimize welfare given distributional information about the domain. 

8. Conclusions 

We presented a general framework to construct algorithms to match buyers with sellers in 
online markets where both valuation and activity-period information are private to agents. 
These algorithms guarantee truthfiil dominant strategies by first imposing a minimum ad- 
mission price for each offer and then pricing and pairing the offer at the first opportunity. 
At the heart of the Chain framework lies a pricing algorithm that must for each offer either 
determine a price independent of any information describing the offer or choose to discard 
the offer. The pricing algorithm should be chosen to match market conditions. We present 
several examples of suitable pricing schemes, including fixed-price, moving-average, and 
McAfee-based schemes. 

More often than not, we find that the competition-based scheme that employs a McAfee- 
based rule to truthfully price the market delivers the best allocative efficiency. For excep- 
tionally low volatility and high arrival intensity, we find that adaptive price-based schemes 
such as an exponentially- weighted moving average (EWMA) and even fixed price schemes 
perform well. We see qualitatively similar results for net efficiency, where the revenue that 
accrues to the auctioneer is discounted, albeit that the price-based rules such as EWMA 
have improved performance because they have no price spread. The observations are rooted 
in simulations comparing the market efficiency under each mechanism with the optimal of- 
fline solution. 

Additionally, we compare the efficiency of our truthful markets with a fixed-price worst- 
case optimal scheme presented by Blum et al. (?), a market of strategic agents using a variant 
on the ZIP price update algorithm developed by Cliff and Bruten (?) for continuous double 
auctions, and a non-truthful, greedy matching algorithm to provide an upper-bound on 
performance. The best of our schemes yield around 33% net efficiency in low volatility, high 
patience environments and 40% net efficiency in high volatility, low patience environments, 
while the greedy bound suggests that as much as 86% efficiency is possible with non-strategic 
agents. We note that the Blum et al. scheme, designed for adversarial settings, fairs poorly 
in our simulations (< 10%). 

One can argue, we think convincingly, that truthfulness brings benefits in itself in that 
it avoids the waste of costly counterspeculation and promotes fairness in markets (?, ?). 
On the other hand, it is certainly of interest that the gap between the efficiency of greedy 



172 



Chain: An Online Double Auction 



matching with non-truthful matching and that of our truthful auctions is so large. Here, we 
observe that the ZIP-populated (non-truthful) markets achieve around 82% efficiency in low 
volatility environments but collapse to around 23% efficiency in high volatility environments. 
Based on this, one might conjecture that designing for truthfulness is especially important 
in badly behaved, highly volatile ( "thin" ) environments but less important in well behaved, 
less volatile ("thick") environments. 

Formalizing this tradeoff between providing absolute truthfulness and approximate 
truthfulness, and while considering the nature of the environment, is an interesting di- 
rection for future work (see paper by ?). Given that reporting of market statistics can be 
incorporated within our framework (see Section 15. 1|) , and given that markets also play a 
role in information aggregation and value discovery, future research should also consider 
this additional aspect of market design. Perhaps there is an interesting tradeoff between 
efficiency, truthful value revelation, and the process of information aggregation. 

While the general Chain framework achieves good efficiency, further tuning seems pos- 
sible. One direction is to adopt a meta-pricing scheme that chooses, or blends, prices from 
competing algorithms. Another direction is to consider richer temporal models; e.g., the 
value of goods to agents might decay or grow over time to better account for the time 
value of assets. A richer temporal model might also consider the possibility of agents or the 
match-maker taking short positions (including short-term cash deficits) to increase trade. 
It is also interesting to extend our work to markets with non-identical goods and more com- 
plex valuation models such as bundle trades (?, ?, ?), and to dynamic matching problems 
without prices, such as an online variation of the classic "marriage" problem (?). 
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Appendix: Proofs 

Lemma 1 Procedure Match defines a valid strong no-trade construction. 

Proof: In all cases, SNT* C NT*. The set NT* is correctly constructed: equal to all 
remaining bids 6* when (j = 0) in Case I, all remaining bids s* when (i = 0) in Case II, 
and all remaining bids and asks otherwise. In each case, no bid (or ask) in NT* could have 
traded at any price because there was no available bid or ask on the opposite of the market 
given its order. 
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In verifying strong no-trade (SNT) conditions (a) and (b), we proceed by case analysis. 
Case I. {i ^ 0) and {j = 0). NT* := 6*. 

(I-l) VA; G s* • (4 = t) and SNT* := 6*. For SNT-a, consider / G NT* with di > t. If 
I deviates and i changes but we remain in Case I then NT* is unchanged and still 
contains I. If I deviates and i — >■ then, we go to Case III and SNT* := 6* Us* and still 
contains I. For SNT-b, consider I G SNT* that deviates with di > t. Again, either we 
remain in this case and SNT* is unchanged or i — > and we go to Case III. But now 
SNT* still contains all 6* and is therefore unchanged for all agents with d^ > t. 

(1-2) Buyer A: G 6* with 4 = * and bk > p* and SNT* := 6*. For SNT-a, consider I G NT* 

with di > t. We remain in this case for any deviation by buyer I because buyer k 
will ensure i ^ 0, and so SNT* remains unchanged and still contains I. For SNT-b, if 
/ G SNT* with di> t deviates we again remain in this case and SNT* is unchanged. 

(1-3) Some seller with dk > t and no buyer with d^i = t willing to accept the price. 
SNT* := 6* \ checkedfi. For SNT-a, consider I G NT* with di > t. First, suppose 
I G checked^ and i ^ I. If Z deviates but still has di > t, then even if i := I then we 
remain in this case and I does not enter SNT*. Second, suppose I G checked^ and 
{i = I). If Z deviates but still has di > t, then even if (i = 0) and (j = 0), wc go 
to Case III and SNT* = and / does not enter SNT*. Third, suppose / ^ checked^ 
and di > t. Deviating while di > t has no effect and we remain in this case and / 
remains m SNT*. For SNT-b, consider I G SNT* with di > t, i.e. with I ^ checkeds. 
If I deviates but di > t, then this has no effect and we remain in this case and SNT* 
remains unchanged. 

Case II. (j / 0) and (i = 0). NT* := s*. Symmetric with Case I. 
Case in. {i = 0) and [j = 0). NT* := 6* U s*. 

(III-l) \/k ■ (4 = t) but 3k' G s* • (4' > t) and SNT* := 6* U s*. For SNT-a, consider 
I G NT* with di > t. This must be an ask. If I deviates but we remain in this case, 
then I remains in SNT*. If j := I, then we go to Case II and SNT* := s* and I remains 
in SNT*. For SNT-b, consider / G SNT* with di > t, which must be an ask. If I 
deviates but we remain in this case, SNT* is unchanged. If / deviates and j := I, then 
we go to Case II, SNT* := s*, and buyers 6* are removed from SNT*. But this is OK 
because all buyers depart in period t anyway. 

(III-2) V/c G s^cdot{dk = t) but 3k' G 6* • (4' > t) and SNT* := 6* U s*. Symmetric to Case 
III-l. 

(III-3) Vk eh* ■ (4 = t) and V/c G s^cdot{dk = t). SNT* := 6* U s*. SNT-a and SNT-b are 
trivially met because no bids or asks have departure past the current period. 

(III-4) 3k G 6* • (4 > t) and 3k' G s* • (4' > t) and SNT* := 0. For SNT-a, consider I G NT* 
with di > t. Assume that I is a bid. If I deviates and di > t and i = then we remain 
in this case and I is not in SNT*. If / deviates and di > t but i := I, then we go to 
Case I and we are necessarily in Sub-case (I-a) because di > t and there can be no 
other bid willing to accept the price (else i / in the first place). Thus, we would 
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have SNT* := 6* \ checked^ and I would not be in SNT*. For SNT-b, this is triviahy 
satisfied because there are no agents / € SNT*. 

□ 

Lemma 5 The set of active agents ( other than i) in period t in Chain is independent of 
i 's report while agent i remains active, and would be unchanged if i 's arrival is later than 
period t. 

Proof: Fix some arrival period cij. Show for any ai > ai, the set of active agents in period 
t > cii while i is active is the same as A* without agent i's arrival until some a[ > t. Proceed 
by induction on the number of periods that t is after Oj. For period t = hi this is trivial. 
Now consider some period + r, for some r > 1 and assume the inductive hypothesis for 
dj + r — 1. Since i is still active then, i G SNT* for i' = + r — 1, and therefore the other 
agents in SNT* that survive into this period are independent of agent z's report by strong 
no-trade condition (b). This completes the proof. □ 

Lemma 6 The price constructed from admission price q and post- arrival price p is value- 
independent and monotonic-increasing when the matching rule in Chain is well-defined, 
the strong no-trade construction is valid, and agent patience is bounded by K . 

Proof: First fix ai,di and 9^i. To show value-independence (Bl), first note that q is 
value-independent, since whether or not i E SNT* in some pre-arrival period t is value- 
independent by strong no-trade condition (a) and price Zi{H^,A^ \ i,uj) in such a period 
is agent-independent by definition. Term p is also value-independent: the decision period 
t* to agent i, if any, is independent of Wi since the other agents that remain active are 
independent of agent i while it is active by Lemma [5l and whether or not i € SNT* is 
value-independent by strong no-trade (a); and the price in t* is value-independent when 
the set of other active agents are value- independent. 

Now fix 6-i and show the price is monotonically-increasing in a tighter arrival-departure 
interval (B2). First note that q is monotonic-increasing in [dj, di] C [oj, di] because an earlier 
di and later dj increases the domain t G [dj — K, cii — 1] on which q is defined. Fix some 
Ui > Oi. Argue the price increases with earlier d[ < di, for any di > hi. To see this, note that 
either di < t* and so pi{ai,d^,9^i, w) = oo for all d'^ < di, or dj > t* and the price is constant 
until di < t* at which point it becomes oo. Fix some dj > a^. Argue the price increases with 
later > dj, where en > di — K. First, while a- < t*, then p is unchanged by Lemma O 
The interesting case is when a- > i*, especially when q{ai,di,6-i,uj) < p{di,di,9-i,uj). 
By reporting a later arrival, the agent can delay its decision period and perhaps hope to 
achieve a lower price. But, note that in this case t* € [di — K,a[ — 1] since di — K < en and 
t* G [dj, a[ — 1] and so q{a[, di, 6-i,uj) > p{H** , A^* \ i, w) because q now includes the price 
in period t* since i ^ SNT** in that pre-arrival period by Lemma O Overall, we see that 
although p may decrease, max(q,p) cannot decrease. □ 

Lemma 7 A strongly truthful, canonical dynamic DA must define price pi{ai,di,6-i,uj) > 
Zi{H^*,A^* \ i,uj) where t* is the decision period for bid i (if it exists). Moreover, the bid 
must be priced-out in period t* if it is not matched. 

Proof: (a) First, suppose Zi{H^*,A^* \i,u:) > Wi but bid i is not priced-out and instead 
survives as an active bid into the next period. But with i ^ SNT**, the set of active bids in 
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period t * +1 need not be independent of agent i's bid and the price Zi{H^*^^, j\t*+i \^ 
need not be agent-independent. Yet, canonical rule (iii) requires that this price be used 
to determine whether or not the agent matches, and so the dynamic DA need not be 
truthful, (b) Now assume for contradiction that pi{ai,di,6-i,iLi) < Zi{H^*,A^* \ i,uj). First, 
if Zi{H^* ,A^* \ i,uj) < oo, then an agent with value pi{ai,di,9^i, uj) < Wi < Zi{H** , A** \i, w) 
will report Wi = Zi{H^* , A^* \i,uj) + e and trade now for a final payment less than its true 
value (whereas it would be priced-out if it reported its true value). If Zi{H^* ,A^* \i, uj) = oo, 
then pi{ai,di, 9-i,u) < Zi{H^* , A^*\i, uj) implies that some bids will survive this period even 
though they are priced-out by the matching rule and not in the strong no-trade set. This 
compromises the truthfulness of the dynamic DA, as discussed in part (a). □ 

Lemma 8 A strongly truthful, canonical and individual-rational dynamic DA must define 
price pi{di,di,6-i,Lo) > q{di,di,6-i,uj), and a bid with Wi < q{di,di,6^i,uj) must be priced- 
out upon admission. 

Proof: Suppose di < di + K so that [di — K,di — 1] is non-empty. For di = di + K — 1, 
when t = di — K is a decision period (and i ^ SNT*), we have 

Pi{di,di,9-i,uj) > pi{di - K,di,9^i,uj) > Zi{H\A^ \ i,^^), (24) 

where the first inequality is by monotonicity (B2) and the second follows from Lemma [7| 
since di — K is a decision period, and would remain one with report 9'^ = {di — K,di,w'j) 
by LemmaO This establishes pi{di, di, O-i^u) > q{di,di,9-i, uj) for di = di + K — 1. When 
di = di + K — 2, then we need Eq. (p^ . and also when t = di — K + 1 is a decision period 
(and i ^ SNT*) we have, 

Pi{di,di,9-i,uj) > pi{di - K + l,di,9-i,uj) > Zi{H\A^ \ i,uj), (25) 

by the same reasoning as above. This generalizes to dj = Oi + K — r for r G {2, . . . , 
to establish pi{di,di,9-i,u}) > q{di,di,9-i,uj) for the general case. To see the bid must be 
priced-out when Wi < q{di, di,9-i, uj), note that if it were to remain active it could match in 
the matching rule and by canonical (iii) need to trade, and thus fail individual-rationality 
since the payment collected would be more than the value. □ 
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